Machine Learning Expedition

Sign in Subscribe

Machine Learning Expedition

What is Reinforcement Learning from Human Feedback?

Reinforcement learning from human feedback (RLHF) trains AI systems to generate text or take actions that align with human preferences. RLHF has become one of the central methods for fine-tuning large language models. In particular, RLHF was a key component for training GPT-4, Claude, Bard, and LLaMA-2 chat models. RLHF

How Much Are Machine Learning Engineers Making?

Machine learning engineering has become one of the most in-demand and highly compensated careers in tech. As more companies adopt machine learning techniques to analyze data, make predictions, and automate tasks, there is a growing need for technical professionals who can build, optimize, and maintain machine learning systems. In this

What are large language models?

The realm of language models (LMs), particularly in the field of natural language processing, has been witnessing significant advancements. These models are probabilistic systems designed to identify and learn statistical patterns in language. The current success of LMs positions them as comprehensive understanding systems of natural language, even though they

Building a Logistic Regression Classifier in PyTorch

Logistic regression is a popular machine learning algorithm used for binary classification problems. It models the probability of an input belonging to a particular class. In this post, we will walk through how to implement logistic regression in PyTorch. While there are many other libraries such as sklearn which provide

The Expanding Universe of Large Language Models

Large language models(LLM) have become quite a phenomenon in the last year! In this post, we will cover some of the popular LLMs. I will continue to update this LLM list as more LLMs(both closed and open source) are released over the year. We will mainly categorize LLMs

Welcome

Welcome

This is Machine Learning Expedition, a brand new site that's just getting started. Things will be up and running here shortly, but you can subscribe in the meantime if you'd like to stay up to date and receive emails when new content is published!