PinnedPublished inTDS ArchiveA long-term Data Science roadmap which WON’T help you become an expert in only several monthsFrom time to time I’m asked: how does one become Data scientist? What courses are necessary? How long will it take? How did you become DS…Dec 2, 201819Dec 2, 201819
Paper Review: Titans: Learning to Memorize at Test TimeTitans: Transformers with attention for short-term memory and neural memory module for long-term memory!Feb 3Feb 3
Paper Review: DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement LearningLLM with reasoning via Reinforcement LearningJan 27Jan 27
Paper Review: STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video…Improved Video-Super resolution to deal with artifacts and improve fidelity!Jan 13Jan 13
Paper Review: Training Large Language Models to Reason in a Continuous Latent SpaceLLM reasoning: Chain of Continuous Thought instead of Chain-of-Thought!Jan 6Jan 6
Paper Review: Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory…Modern BERT: with modern design, better training, and more data!Dec 23, 2024Dec 23, 2024
Paper Review: Byte Latent Transformer: Patches Scale Better Than TokensLLM trained not on the tokens but on the bytes! New efficiency unlocked!Dec 16, 2024Dec 16, 2024
Paper Review: Reverse Thinking Makes LLMs Stronger ReasonersTraining student LLM by making teacher LLM generate questions, forward reasoning, backward questions, and backward reasoningDec 9, 2024Dec 9, 2024
Paper Review: Project Sid: Many-agent simulations toward AI civilizationMany-agent simulations for collaborationNov 25, 2024Nov 25, 2024
Paper Review: Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster…LLM performing at Kaggle Grandmaster Level! Or maybe not.Nov 11, 2024Nov 11, 2024