libraries

Databolt Pipeline

Databolt Pipeline

0
d6tpipe is a python library which makes it easier to exchange data files. It’s like git for data! But better because you can include it in your data science code. Link Documentation
Example Usage For a Machine Learning Workflow - Databolt Flow

Example Usage For a Machine Learning Workflow - Databolt Flow

0
For data scientists and data engineers, d6tflow is a python library which makes building complex data science workflows easy, fast and intuitive. It is primarily designed for data scientists to build better models faster. For data engineers, it can also be a lightweight alternative and help productionize data science models faster. Unlike other data pipeline/workflow solutions, d6tflow focuses on managing data science research workflows instead of managing production data pipelines.
data-diff

data-diff

0
data-diff is a command-line tool and Python library to efficiently diff rows across two different databases. Link
Hasura GraphQL Engine

Hasura GraphQL Engine

0
Hasura is an open source product that accelerates API development by 10x by giving you GraphQL or REST APIs with built in authorization on your data, instantly. Link
Polars DataFrame library

Polars DataFrame library

0
This book is an introduction to the Polars DataFrame library. Its goal is to introduce you to Polars by going through examples and comparing it to other solutions. Some design choices are introduced here. The guide will also introduce you to optimal usage of Polars. Link
smart_open — utils for streaming large files in Python

smart_open — utils for streaming large files in Python

0
smart_open is a Python 3 library for efficient streaming of very large files from/to storages such as S3, GCS, Azure Blob Storage, HDFS, WebHDFS, HTTP, HTTPS, SFTP, or local filesystem. It supports transparent, on-the-fly (de-)compression for a variety of different formats. Link
SberProcessMining (SberPM) – Process Mining Python framework

SberProcessMining (SberPM) – Process Mining Python framework

0
SberPM is an open-source Python library for conducting a comprehensive analysis of business processes with the use of process mining and machine learning techniques. By implementing this tool, objective and deep insights into the process on all levels can be revealed. These insights are then used to detect problems such as bottlenecks and deviations and identify potential opportunities for process improvement and optimization. Link Example
5 Python Libraries for Time-Series Analysis

5 Python Libraries for Time-Series Analysis

0
A Time-Series is a sequence of data points collected at different timestamps. These are essentially successive measurements collected from the same data source at the same time interval. Further, we can use these chronologically gathered readings to monitor trends and changes over time. The time-series models can be univariate or multivariate. The univariate time series models are implemented when the dependent variable is a single time series, like room temperature measurement from a single sensor.