GRPO Reward Decline After Convergence in Gemma-3-4B Fine-tuningPleasantLog5975

1 min ago

All about fine-tuning, LLMs, AI & the Unsloth project!

Mommmmof8

I need uxie. 631193640524
sliver_spear6044

Good question anon
cryptoplanet337

Redbull Fridge Issue
RandyElPanadero

Super saiyan glasses
Mullman33

Has there been any more MoGs announced?
Limiric

The Misadventures of Killa and iBad
oldteeth

Any fun events tonight?
ged6924

close ba tayo?
Odd-evenJournalist

Buying a used 2007 corolla with 238000km
pinkdEvil0819

Alex Gonzaga as the next first lady of Lipa, Batangas???
ethansmitter

H: Caps W: Weenie Wagon Plan
im___new___here

Congrats Reaves you are now a Dallas Maverick
Legitimate_Pomelo554

Room in Summerville (UGRENT PRICE DROP)
SnooFloofs7490

[US-CT] [H] Pulsar X2 CRAZYLIGHT (sunset haze), Hyperlight w/8K dongle (white). [W] PayPal, Local Cash
Impressive-Affect-35

Transferring Ring
PrincessBear0520

Trading pet wears for pet :D
Quick_Extension_3115

I need to help my Uncle Jack off a horse
AlphaZero71

I-765 F1 undergrad student
Valuable-Passion9731

Top Comment adds more restrictions to ChatGPT and topic it must explain [Day 1]
porg_wrangler

Shifter help
Ok_Board_6407

Need help for wire identification to replace 2 light switches
lite_hause

Review from my first time in Berghain