Category: neurips

  • Beyond Causal Language Modeling

    Beyond Causal Language Modeling A deep dive into “Not All Tokens Are What You Need for Pretraining” Introduction A few days ago, I had the chance to present at a local reading group that focused on some of the most exciting and insightful papers from NeurIPS 2024. As a presenter, I selected a paper titled…

  • Classifier-free guidance for LLMs performance enhancing

    Classifier-free guidance for LLMs performance enhancing Classifier-Free Guidance for LLMs Performance Enhancing Check and improve classifier-free guidance for text generation large language models. While participating in NeurIPS 2024 Competitions track I was awarded the second prize in the LLM Privacy challenge. The solution I had used classifier-free guidance (CFG). I noticed that with high CFG guidance…