PinnedPublished inTDS ArchiveA long-term Data Science roadmap which WON’T help you become an expert in only several monthsFrom time to time I’m asked: how does one become Data scientist? What courses are necessary? How long will it take? How did you become DS…Dec 2, 201819Dec 2, 201819
Two Years of Studying and Practicing Foreign LanguagesMy experience of studying and practicing foreign languages: Spanish, German, Japanese1d ago1d ago
Paper Review: NeoBERT: A Next-Generation BERTNeoBERT: a next-gen encoder with 4K tokens context length that outperforms larger models like RoBERTa on MTEB!3d ago3d ago
Paper Review: SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding…Visual Language Encoder with Improved Semantic Understanding, Localization, and Dense Features!Feb 24Feb 24
Paper Review: Goku: Flow Based Video Generative Foundation ModelsJoint image and video generation using rectified flow Transformers!Feb 17Feb 17
Paper Review: Titans: Learning to Memorize at Test TimeTitans: Transformers with attention for short-term memory and neural memory module for long-term memory!Feb 3Feb 3
Paper Review: DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement LearningLLM with reasoning via Reinforcement LearningJan 27Jan 27
Paper Review: STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video…Improved Video-Super resolution to deal with artifacts and improve fidelity!Jan 13Jan 13
Paper Review: Training Large Language Models to Reason in a Continuous Latent SpaceLLM reasoning: Chain of Continuous Thought instead of Chain-of-Thought!Jan 6Jan 6
Paper Review: Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory…Modern BERT: with modern design, better training, and more data!Dec 23, 2024Dec 23, 2024