Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
from
login
Analyzing OpenAI's Reinforcement Fine-Tuning: Less Data, Better Results
(
openpipe.ai
)
4 points
by
kcorbitt
22 days ago
|
past
Using reinforcement learning and $4.80 of GPU time to find the best HN post
(
openpipe.ai
)
217 points
by
kcorbitt
85 days ago
|
past
|
95 comments
DPO fine-tuning outperforms SFT
(
openpipe.ai
)
1 point
by
kcorbitt
3 months ago
|
past
OpenPipe
(
openpipe.ai
)
1 point
by
handfuloflight
3 months ago
|
past
Fine-Tuning Best Practices: Models
(
openpipe.ai
)
2 points
by
gk1
3 months ago
|
past
Fine-Tuning for Production Apps
(
openpipe.ai
)
2 points
by
ijidak
4 months ago
|
past
Fine-Tuning Best Practices Series Introduction and Chapter 1: Training Data
(
openpipe.ai
)
3 points
by
sebg
4 months ago
|
past
LLM Fine-Tuning Best Practices: Base Models Proprietary/Open Source, Large/Small
(
openpipe.ai
)
2 points
by
billmalarky
4 months ago
|
past
|
1 comment
LLM Fine-Tuning Best Practices for Training Data Curation
(
openpipe.ai
)
1 point
by
billmalarky
5 months ago
|
past
|
2 comments
OpenPipe Mixture of Agents: Outperform GPT-4 at 1/25th the Cost
(
openpipe.ai
)
13 points
by
kcorbitt
7 months ago
|
past
|
2 comments
What we've learned in 3 days of Llama 3
(
openpipe.ai
)
3 points
by
kcorbitt
9 months ago
|
past
Mixtral Curious? Comparing Mistral 7B and Mixtral for fine-tuning
(
openpipe.ai
)
1 point
by
kcorbitt
10 months ago
|
past
S-LoRA: Serving Thousands of Models from One GPU for Fun and Profit
(
openpipe.ai
)
1 point
by
kcorbitt
on Jan 18, 2024
|
past
Mistral 7B Fine-Tune Optimized
(
openpipe.ai
)
234 points
by
tosh
on Dec 20, 2023
|
past
|
103 comments
Is AI the next crypto? Insights from HN comments
(
openpipe.ai
)
237 points
by
kcorbitt
on Nov 8, 2023
|
past
|
367 comments
Consider applying for YC's Spring batch! Applications are open till Feb 11.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: