Felipe Honorato

Enhancing AI Alignment with Direct Preference Optimization

Posted on Wed 14 February 2024 in deep learning • 3 min read

Aligning models to human values and desires is a formidable task, particularly when we're working with vast datasets that resist meticulous control. Concerns arise when language models spew out content that could be false or harmful. Luckily, advancements in AI research are opening pathways for safer and more reliable interactions …

Low-rank Adaptation

Posted on Mon 29 January 2024 in deep learning • 4 min read

Large language models, as the name already states, have a huge quantity of parameters and have been trained on large-scale datasets to obtain excellent generalization capabilities.

But if it's so good, why would people want to modify such models?
1. To have a specific behavior - Let's say you have a …