Posts

The Architectures Powering Machine Learning at Google, Facebook, Uber, LinkedIn

The Architectures Powering Machine Learning at Google, Facebook, Uber, LinkedIn

0
The challenge of establishing reference architectures for large-scale machine learning solutions is accentuated by two main factors: Machine learning frameworks and infrastructure have evolved considerably faster than the adoption of those technologies in mainstream environments. The lifecycle of machine learning solutions is fundamentally different from other software disciplines. Link
What is Metaflow

What is Metaflow

0
Metaflow is a human-friendly Python library that helps scientists and engineers build and manage real-life data science projects. Metaflow was originally developed at Netflix to boost the productivity of data scientists who work on a wide variety of projects from classical statistics to state-of-the-art deep learning. Metaflow provides a unified API to the infrastructure stack that is required to execute data science projects, from prototype to production. Link
A powerful and flexible machine learning platform for drug discovery

A powerful and flexible machine learning platform for drug discovery

0
TorchDrug is a machine learning platform designed for drug discovery, covering techniques from graph machine learning (graph neural networks, geometric deep learning & knowledge graphs), deep generative models to reinforcement learning. It provides a comprehensive and flexible interface to support rapid prototyping of drug discovery models in PyTorch. Link
KG Course 2021

KG Course 2021

0
Курс по графам знаний (Knowledge Graphs) и как их готовить в 2021 году. На русском языке. Link
PyG (PyTorch Geometric) is a library built upon PyTorch to easily write and train Graph Neural Networks (GNNs) for a wide range of applications related to structured data.

PyG (PyTorch Geometric) is a library built upon PyTorch to easily write and train Graph Neural Networks (GNNs) for a wide range of applications related to structured data.

0
It consists of various methods for deep learning on graphs and other irregular structures, also known as geometric deep learning, from a variety of published papers. In addition, it consists of easy-to-use mini-batch loaders for operating on many small and single giant graphs, multi GPU-support, distributed graph learning via Quiver, a large number of common benchmark datasets (based on simple interfaces to create your own), the GraphGym experiment manager, and helpful transforms, both for learning on arbitrary graphs as well as on 3D meshes or point clouds.
Are You Still Using Pandas to Process Big Data in 2021?

Are You Still Using Pandas to Process Big Data in 2021?

0
Dask provides advanced parallelism for analytics, enabling performance at scale for the tools you love. This includes numpy, pandas and sklearn. It is open-source and freely available. It uses existing Python APIs and data structures to make it easy to switch between Dask-powered equivalents. Vaex is a high-performance Python library for lazy Out-of-Core DataFrames (similar to Pandas), to visualize and explore big tabular datasets. It can calculate basic statistics for more than a billion rows per second.
Building a Knowledge Graph for Job Search Using BERT

Building a Knowledge Graph for Job Search Using BERT

0
In this tutorial, we will build a job recommendation and skill discovery script that will take unstructured text as input, and will then output job recommendations and skill suggestions based on entities such as skills, years of experience, diploma, and major. We will extract entities and relations from job descriptions using the BERT model and we will attempt to build a knowledge graph from skills and years of experience. Link