| | The State of Reasoning Models (sebastianraschka.com) |
| 4 points by sbbq 8 months ago | past |
|
| | The State of LLM Reasoning Models Part 1: Inference-Time Compute Scaling Methods (sebastianraschka.com) |
| 3 points by yaiml 9 months ago | past |
|
| | Understanding Reasoning LLMs (sebastianraschka.com) |
| 473 points by sebg 10 months ago | past | 183 comments |
|
| | Understanding Reasoning LLMs (sebastianraschka.com) |
| 4 points by sbbq 10 months ago | past |
|
| | Noteworthy LLM Research Papers of 2024 Megapost (sebastianraschka.com) |
| 5 points by yaiml 10 months ago | past |
|
| | Implementing a Byte Pair Encoding (BPE) Tokenizer from Scratch (sebastianraschka.com) |
| 2 points by headalgorithm 10 months ago | past |
|
| | Implementing a Byte Pair Encoding (BPE) Tokenizer from Scratch (sebastianraschka.com) |
| 4 points by sbbq 10 months ago | past |
|
| | AI Research Recap 2024: From New Scaling Laws to Scaling Inference Compute (sebastianraschka.com) |
| 1 point by sbbq 10 months ago | past |
|
| | Noteworthy AI Research Papers of 2024 (Part One) (sebastianraschka.com) |
| 1 point by birdculture 11 months ago | past |
|
| | Noteworthy AI Research Papers of 2024 (Part One) (sebastianraschka.com) |
| 1 point by sbbq 11 months ago | past |
|
| | Collection of 1k LLM Research Papers of 2024 (sebastianraschka.com) |
| 4 points by sbbq 11 months ago | past |
|
| | LLM Research Papers: The 2024 List (sebastianraschka.com) |
| 5 points by ModelForge 11 months ago | past |
|
| | LLM Research Papers: The 2024 List (sebastianraschka.com) |
| 1 point by mdp2021 12 months ago | past |
|
| | Understanding Multimodal LLMs (sebastianraschka.com) |
| 2 points by lapnect on Nov 4, 2024 | past |
|
| | Understanding Multimodal LLMs: The Main Techniques and Latest Models (sebastianraschka.com) |
| 4 points by sbbq on Nov 3, 2024 | past |
|
| | Building a GPT-Style LLM Classifier from Scratch (sebastianraschka.com) |
| 2 points by mdp2021 on Sept 21, 2024 | past |
|
| | Building LLMs from the Ground Up: A 3-Hour Coding Workshop (sebastianraschka.com) |
| 970 points by mdp2021 on Aug 31, 2024 | past | 136 comments |
|
| | Show HN: New LLM Pre-Training and Post-Training Paradigms (sebastianraschka.com) |
| 2 points by rasbt on Aug 21, 2024 | past |
|
| | New LLM Pre-Training and Post-Training Paradigms: How Modern LLMs Are Trained (sebastianraschka.com) |
| 5 points by sbbq on Aug 17, 2024 | past |
|
| | Developing an LLM: Building, Training, Finetuning (sebastianraschka.com) |
| 1 point by Anon84 on June 13, 2024 | past |
|
| | Understanding the LLM Development Cycle: Building, Training, Finetuning (sebastianraschka.com) |
| 3 points by rasbt on June 8, 2024 | past |
|
| | The latest major open LLM releases: Mixtral, Llama 3, Phi-3, and OpenELM (sebastianraschka.com) |
| 5 points by rasbt on May 12, 2024 | past |
|
| | Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com) |
| 2 points by sbbq on April 2, 2024 | past |
|
| | Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com) |
| 2 points by tosh on April 1, 2024 | past |
|
| | Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com) |
| 1 point by Anon84 on March 31, 2024 | past |
|
| | Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com) |
| 2 points by rasbt on March 31, 2024 | past |
|
| | AI Research in Feb 2024 – LoRA Successor, "Small" LLMs, Transparent LLM Research (sebastianraschka.com) |
| 3 points by rasbt on March 3, 2024 | past |
|
| | Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch (sebastianraschka.com) |
| 96 points by rasbt on Feb 18, 2024 | past | 10 comments |
|
| | AI Research Papers in Jan 2024: Model Merging, Mixtures of Experts, Smaller LLMs (sebastianraschka.com) |
| 20 points by rasbt on Feb 3, 2024 | past |
|
| | Naive Bayes and Text Classification I – Introduction and Theory (2014) (sebastianraschka.com) |
| 2 points by vikrum on Jan 22, 2024 | past |
|
|
| More |