Hacker Newsnew | past | comments | ask | show | jobs | submit | readitalready's submissionslogin
1.Training-Free Group Relative Policy Optimization (arxiv.org)
1 point by readitalready 24 days ago | past
2.Retrieval-Aware Distillation for Transformer-SSM Hybrids (arxiv.org)
2 points by readitalready 26 days ago | past
3.HySparse: A Hybrid Sparse Attention Architecture (arxiv.org)
5 points by readitalready 29 days ago | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: