|
|
| | Mercury 2: Best-in-class speed-optimized intelligence at 1,200 tok/SEC (twitter.com/artificialanlys) | | 1 point by volodia 3 months ago | past | |
| | Grok 4 is now the leading AI model ( ArtificialAnlys) (twitter.com/artificialanlys) | | 13 points by JnBrymn 11 months ago | past | 6 comments | |
| | DeepSeek V3 is now the highest scoring non-reasoning model (twitter.com/artificialanlys) | | 14 points by aurareturn on March 25, 2025 | past | 3 comments | |
| | We've now partially replicated Reflection Llama 3.1 70B's eval claims (twitter.com/artificialanlys) | | 4 points by _micah_h on Sept 8, 2024 | past | 1 comment | |
| | Cerebras launches inference for Llama 3.1; benchmarked at 1846 tokens/s on 8B (twitter.com/artificialanlys) | | 95 points by _micah_h on Aug 27, 2024 | past | 42 comments | |
| | Sambanova breaks 1000 tokens/SEC on LLama3 8B (twitter.com/artificialanlys) | | 7 points by germanjoey on May 28, 2024 | past | |
| | From GPT-4 to Mistral 7B, there is a 300x range in the cost of LLM inference (twitter.com/artificialanlys) | | 2 points by Gcam on Feb 12, 2024 | past | |
| | Mistral API reduces time to first token by 10x (only place for Mistral Medium) (twitter.com/artificialanlys) | | 4 points by Gcam on Feb 5, 2024 | past | |
| | 240 Tokens/s achieved by Groq's custom chips on Lama 2 Chat (70B) (twitter.com/artificialanlys) | | 5 points by Gcam on Jan 31, 2024 | past | |
| | New GPT-4 Turbo (0125 Preview) slightly faster per initial benchmarks (twitter.com/artificialanlys) | | 2 points by Gcam on Jan 26, 2024 | past | |
|

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
|