Uncategorized 3 Sep 2025 · 11 min read LongCat-Flash: 560B AI From a Delivery App?! HOLD UP. A 560B Model from... a Food Delivery App?! 🤯 All right team, gather 'round, because the AI world just got WEIRD. A new Chinese model, LongCat-Flash-Chat, is here. Continue reading: LongCat-Flash: 560B AI From a Delivery App?!
Uncategorized 30 Jan 2025 · 4 min read Tulu3: Advanced Open-Source Language Model Post-Training Introduction In the rapidly evolving field of language models, post-training techniques such as instruction tuning, reinforcement learning from human feedback (RLHF), and advanced fine-tuning have emerged as critical methodologies. However, Continue reading: Tulu3: Advanced Open-Source Language Model Post-Training
Uncategorized 27 Nov 2024 · 3 min read OLMo 2: AI2’s Latest Open Language Models That Challenge the Big Names in Generative AI Introduction The Allen Institute for AI (AI2) has introduced OLMo 2, a family of open language models designed to compete directly with industry heavyweights like Qwen and Llama. This launch Continue reading: OLMo 2: AI2’s Latest Open Language Models That Challenge the Big Names in Generative AI