Nebiyou Hailemariam Reading

Articles

Illustrating Reinforcement Learning from Human Feedback (RLHF)
Comprehensive guide to RLHF: using reinforcement learning to optimize language models with human feedback.
Plan-and-Execute (LangGraph)
Tutorial on building a plan-and-execute style agent in LangGraph.
Part 3: Intro to Policy Optimization (Spinning Up)
Policy gradient derivation, reward-to-go, baselines, and advantage-based policy gradients.
Project: Deep Agents
Certificate in building deep agents with LangChain.
Foundation: Introduction to LangChain - Python
Certificate in LangChain foundations for building applications with LLMs using Python.
Foundation: Introduction to LangGraph
Certificate in LangGraph foundations for building stateful, multi-actor applications with LLMs.
Training and Finetuning Reranker Models with Sentence Transformers v4
March 26, 2025
Guide to training and finetuning reranker models with Sentence Transformers v4.
Mastering RAG: How to Select a Reranking Model
Galileo AI
Guide to selecting reranking models for RAG systems.
Introduction to Recommender Systems: Content-Based, Collaborative Filtering, and Hybrid Recommendation Engines
Alpha Quantum
Introduction to recommender systems: content-based, collaborative, and hybrid approaches.
Reverse-mode automatic differentiation from scratch, in Python
June 11, 2020
Building a minimal autodiff framework from scratch with Python implementation.
A Practical Guide to Contrastive Learning
July 30, 2024
Building SimSiam models with FashionMNIST for self-supervised learning.

Books

Research Papers

Nebiyou Hailemariam – Reading