Hacker News new | past | comments | ask | show | jobs | submit | from login
Analyzing OpenAI's Reinforcement Fine-Tuning: Less Data, Better Results (openpipe.ai)
4 points by kcorbitt 22 days ago | past
Using reinforcement learning and $4.80 of GPU time to find the best HN post (openpipe.ai)
217 points by kcorbitt 85 days ago | past | 95 comments
DPO fine-tuning outperforms SFT (openpipe.ai)
1 point by kcorbitt 3 months ago | past
OpenPipe (openpipe.ai)
1 point by handfuloflight 3 months ago | past
Fine-Tuning Best Practices: Models (openpipe.ai)
2 points by gk1 3 months ago | past
Fine-Tuning for Production Apps (openpipe.ai)
2 points by ijidak 4 months ago | past
Fine-Tuning Best Practices Series Introduction and Chapter 1: Training Data (openpipe.ai)
3 points by sebg 4 months ago | past
LLM Fine-Tuning Best Practices: Base Models Proprietary/Open Source, Large/Small (openpipe.ai)
2 points by billmalarky 4 months ago | past | 1 comment
LLM Fine-Tuning Best Practices for Training Data Curation (openpipe.ai)
1 point by billmalarky 5 months ago | past | 2 comments
OpenPipe Mixture of Agents: Outperform GPT-4 at 1/25th the Cost (openpipe.ai)
13 points by kcorbitt 7 months ago | past | 2 comments
What we've learned in 3 days of Llama 3 (openpipe.ai)
3 points by kcorbitt 9 months ago | past
Mixtral Curious? Comparing Mistral 7B and Mixtral for fine-tuning (openpipe.ai)
1 point by kcorbitt 10 months ago | past
S-LoRA: Serving Thousands of Models from One GPU for Fun and Profit (openpipe.ai)
1 point by kcorbitt on Jan 18, 2024 | past
Mistral 7B Fine-Tune Optimized (openpipe.ai)
234 points by tosh on Dec 20, 2023 | past | 103 comments
Is AI the next crypto? Insights from HN comments (openpipe.ai)
237 points by kcorbitt on Nov 8, 2023 | past | 367 comments

Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: