deep-learning

Using graph neural networks to recommend related products

Recommending related products — say, a phone case to go along with a new phone — is a fundamental capability of e-commerce sites, one that saves customers time and leads to more satisfying shopping experiences. Link

We dive into some of the internals of MLPs with multiple layers and scrutinize the statistics of the forward pass activations, backward pass gradients, and some of the pitfalls when they are improperly scaled. We also look at the typical diagnostic tools and visualizations you’d want to use to understand the health of your deep network. We learn why training deep neural nets can be fragile and introduce the first modern innovation that made doing so much easier: Batch Normalization.

Once we go from training one model to training hundreds of different models with different hyperparameters, we need to start organizing. We’re going to break down our organization into three pieces: experiment tracking, hyperparameter search, and configuration setup. Link

How diffusion models work: the math from scratch

Diffusion models are a new class of state-of-the-art generative models that generate diverse high-resolution images. They have already attracted a lot of attention after OpenAI, Nvidia and Google managed to train large-scale models. Example architectures that are based on diffusion models are GLIDE, DALLE-2, Imagen, and the full open-source stable diffusion. Link

Spacetimeformer Multivariate Forecasting

Spacetimeformer is a Transformer that learns temporal patterns like a time series model and spatial patterns like a Graph Neural Network. Link

Transfer learning for Time Series Forecasting

Transfer learning refers to the process of pre-training a flexible model on a large dataset and using it later on other data with little to no training. It is one of the most outstanding 🚀 achievements in Machine Learning 🧠 and has many practical applications. Link

New users can start creating straight away. Lessons learned from deployment and improvements to our safety systems make wider availability possible. Link

labml.ai Annotated PyTorch Paper Implementations

This is a collection of simple PyTorch implementations of neural networks and related algorithms. These implementations are documented with explanations, and the website renders these as side-by-side formatted notes. We believe these would help you understand these algorithms better. Link

12 Most Popular NLP Projects of 2022 So Far

Natural Language Processing remains one of the hottest topics of 2022. By using GitHub stars (albeit certainly not the only measure) as a proxy for popularity, we took a look at what NLP projects are getting the most traction so far this year, just as we recently did with machine learning projects. It’s a list with some familiar names but there are plenty of surprises also! Link

Recreating DeepMind’s AlphaZero - AI Plays Connect 4 Link

A model registry is a central repository that is used to version control Machine Learning (ML) models. It simply tracks the models while they move between training, production, monitoring, and deployment. Link

This document is intended to help those with a basic knowledge of machine learning get the benefit of Google’s best practices in machine learning. It presents a style for machine learning, similar to the Google C++ Style Guide and other popular guides to practical programming. If you have taken a class in machine learning, or built or worked on a machine-learned model, then you have the necessary background to read this document.

MinImagen - Build Your Own Imagen Text-to-Image Model

Text-to-Image models have made great strides this year, from DALL-E 2 to the more recent Imagen model. In this tutorial learn how to build a minimal Imagen implementation - MinImagen. Link

Build a GNN-based real-time fraud detection solution using Amazon SageMaker, Amazon Neptune, and the Deep Graph Library

We focus on four tasks: Processing a tabular transaction dataset into a heterogeneous graph dataset Training a GNN model using SageMaker Deploying the trained GNN models as a SageMaker endpoint Demonstrating real-time inference for incoming transactions Link

DALL-E: Inside the Artificial Intelligence program that creates images from textual descriptions

Link

deep-learning

Using graph neural networks to recommend related products

Building makemore Part 3: Activations & Gradients, BatchNorm

CS197 Harvard: AI Research Experiences

How diffusion models work: the math from scratch

Spacetimeformer Multivariate Forecasting

Transfer learning for Time Series Forecasting

DALL·E Now Available Without Waitlist

labml.ai Annotated PyTorch Paper Implementations

12 Most Popular NLP Projects of 2022 So Far

Alpha Connect

Best ML Model Registry Tools

Best Practices for ML Engineering

MinImagen - Build Your Own Imagen Text-to-Image Model

Build a GNN-based real-time fraud detection solution using Amazon SageMaker, Amazon Neptune, and the Deep Graph Library

DALL-E: Inside the Artificial Intelligence program that creates images from textual descriptions