Posts

A high-level plotting API for the PyData ecosystem built on HoloViews.

A high-level plotting API for the PyData ecosystem built on HoloViews.

0
hvPlot provides a high-level plotting API built on HoloViews that provides a general and consistent API for plotting data in all the abovementioned formats. hvPlot can integrate neatly with the individual libraries if an extension mechanism for the native plot APIs is offered, or it can be used as a standalone component. To get started jump straight into the installation instructions and check out the current functionality in the User Guide.
CS25 I Stanford Seminar

CS25 I Stanford Seminar

0
Since their introduction in 2017, transformers have revolutionized Natural Language Processing (NLP). Now, transformers are finding applications all over Deep Learning, be it computer vision (CV), reinforcement learning (RL), Generative Adversarial Networks (GANs), Speech or even Biology. Among other things, transformers have enabled the creation of powerful language models like GPT-3 and were instrumental in DeepMind’s recent AlphaFold2, that tackles protein folding. Link
Databolt Flow

Databolt Flow

0
For data scientists and data engineers, d6tflow is a python library which makes building complex data science workflows easy, fast and intuitive. It is primarily designed for data scientists to build better models faster. For data engineers, it can also be a lightweight alternative and help productionize data science models faster. Unlike other data pipeline/workflow solutions, d6tflow focuses on managing data science research workflows instead of managing production data pipelines.
Databolt Pipeline

Databolt Pipeline

0
d6tpipe is a python library which makes it easier to exchange data files. It’s like git for data! But better because you can include it in your data science code. Link Documentation
GPU Puzzles

GPU Puzzles

0
GPU architectures are critical to machine learning, and seem to be becoming even more important every day. However you can be an expert in machine learning without ever touching GPU code. It is a bit weird to be work always through abstraction. Link
ML Systems Design Interview Guide

ML Systems Design Interview Guide

One of the trickiest interview rounds for ML practitioners is ML systems design. If you’re applying to be a Data Scientist, ML Engineer or ML Manager at a big tech company, you’ll probably face an ML Systems design question. I recently tackled this question at a few big tech companies on my way to becoming a Staff ML Engineer at Pinterest. In this article I’m going to talk about how to approach ML Systems Design interviews, core concepts to know and I’ll provide links to some of the resources I used.
Templating your SQL Queries Using Jinga on dbt

Templating your SQL Queries Using Jinga on dbt

0
DBT(data build tool) is a data transformation tool that enables data engineers and analysts to transform and document data. It provides the transformation layer in ELT(export-load-transform) process. It also facilitates how data professionals can build scalable and maintainable code just like software engineers. Link