Submissions from openpipe.ai

		Analyzing OpenAI's Reinforcement Fine-Tuning: Less Data, Better Results (openpipe.ai)
		4 points by kcorbitt 22 days ago \| past
		Using reinforcement learning and $4.80 of GPU time to find the best HN post (openpipe.ai)
		217 points by kcorbitt 85 days ago \| past \| 95 comments
		DPO fine-tuning outperforms SFT (openpipe.ai)
		1 point by kcorbitt 3 months ago \| past
		OpenPipe (openpipe.ai)
		1 point by handfuloflight 3 months ago \| past
		Fine-Tuning Best Practices: Models (openpipe.ai)
		2 points by gk1 3 months ago \| past
		Fine-Tuning for Production Apps (openpipe.ai)
		2 points by ijidak 4 months ago \| past
		Fine-Tuning Best Practices Series Introduction and Chapter 1: Training Data (openpipe.ai)
		3 points by sebg 4 months ago \| past
		LLM Fine-Tuning Best Practices: Base Models Proprietary/Open Source, Large/Small (openpipe.ai)
		2 points by billmalarky 4 months ago \| past \| 1 comment
		LLM Fine-Tuning Best Practices for Training Data Curation (openpipe.ai)
		1 point by billmalarky 5 months ago \| past \| 2 comments
		OpenPipe Mixture of Agents: Outperform GPT-4 at 1/25th the Cost (openpipe.ai)
		13 points by kcorbitt 7 months ago \| past \| 2 comments
		What we've learned in 3 days of Llama 3 (openpipe.ai)
		3 points by kcorbitt 9 months ago \| past
		Mixtral Curious? Comparing Mistral 7B and Mixtral for fine-tuning (openpipe.ai)
		1 point by kcorbitt 10 months ago \| past
		S-LoRA: Serving Thousands of Models from One GPU for Fun and Profit (openpipe.ai)
		1 point by kcorbitt on Jan 18, 2024 \| past
		Mistral 7B Fine-Tune Optimized (openpipe.ai)
		234 points by tosh on Dec 20, 2023 \| past \| 103 comments
		Is AI the next crypto? Insights from HN comments (openpipe.ai)
		237 points by kcorbitt on Nov 8, 2023 \| past \| 367 comments