The building blocks behind ChatGPT

Episode 2,   Mar 07, 2023, 06:23 AM

Join Rafael and Marcia as they welcome Matt Kidd, Senior Data Scientist (NLP) from Deeper Insights, for a discussion on InstructGPT, the predecessor of ChatGPT. The discussion centres on the paper "Training language models to follow instructions with human feedback" (2022) authored by the OpenAI team.

Join Rafael and Marcia as they welcome Matt Kidd, Senior Data Scientist (NLP) from Deeper Insights, for a discussion on InstructGPT, the predecessor of ChatGPT. The main discussion revolves around the impact of using alignment techniques, namely  Reinforcement Learning from Human Feedback (RLHF), on the usefulness and widespread use of Large Language Models (LLMs). Centres around the paper "Training language models to follow instructions with human feedback" (2022) authored by the OpenAI team. They cover topics like alignment with human intentions, RLHF and the finer areas of what makes this paper a seminal paper for the generative AI communities. If you are interested in reading the paper and following along please click this link: https://arxiv.org/pdf/2203.02155.pdf 

For more information on all things artificial intelligence, generative AI, machine learning, and engineering for your business please visit www.deeperinsights.com or reach out to us at thepaperclub@deeperinsights.com.