Deepseek in LLM Leaderboard vs O1

News

o1, as it is called ... models that Chinese firms are in the vanguard: in December DeepSeek published a new large language model (LLM), a form of AI that analyses and generates text.

OS X Daily2d

How to Run DeepSeek LLM Locally on Mac

If you follow AI news, or even tech news, you might have heard of DeepSeek by now, the powerful Chinese Large Language Model ...

DeepSeek unveils new technique for smarter, scalable AI reward models

Reward models holding back AI? DeepSeek's SPCT creates self-guiding critiques, promising more scalable intelligence for ...

EurekAlert!4d

DeepSeek’s impact on thoracic surgeons’ work patterns—past, present and future

On January 27, 2025, the release of the new open-source large language model (LLM), DeepSeek, caused a global sensation. Humans have been working on developing artificial intelligence (AI) capable of ...

GIGAZINE5d

DeepSeek and Tsinghua University Researchers Announce New Method to Enhance LLM Inference Capabilities

Researchers at DeepSeek, a Chinese AI startup that develops DeepSeek-R1 and other AI apps, have developed a new approach to improve the inference capabilities of general large-scale language ...

Computer Weekly4dOpinion

DeepSeek will help evolve the conversation around privacy

The rise of DeepSeek has prompted the usual well-documented concerns around AI, but also raised worries about its potential ...

DeepSeek-GRM: Introducing an Enhanced AI Reasoning Technique

Researchers from DeepSeek and Tsinghua University say combining two techniques improves the answers the large language model ...

Unite.AI5d

Bespoke LLMs for Every Business? DeepSeek Shows Us the Way

Once upon a time, the tech clarion call was “cellphones for everyone” – and indeed mobile communications have revolutionized business (and the world). Today, the equivalent of that call is to give ...

Nvidia’s new Llama-3.1 Nemotron Ultra outperforms DeepSeek R1 at half the size

Compared to DeepSeek R1, Llama-3.1-Nemotron-Ultra-253B shows competitive results despite having less than half the parameters.

Anadolu Ajansi4d

DeepSeek introduces new method for enhancing reasoning abilities of large language models

Reward modeling is a process used to align an LLM’s behavior with human preferences. DeepSeek plans to make its GRM models open source, the researchers said, although no specific timeline was given.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results