Skip to content
View ddelange's full-sized avatar
💥
["translatio", "imitatio", "aemulatio"]
💥
["translatio", "imitatio", "aemulatio"]

Block or report ddelange

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

etl

Extract-Transform-Load, Data Wrangling, Data Mining, ...
267 repositories

Fastest and safest Rust implementation of parquet. `unsafe` free. Integration-tested against pyarrow

Rust 365 62 Updated Jul 31, 2024

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Rust 34,989 2,358 Updated Aug 21, 2025

Synthetic Data Generation for mixed-type, multivariate time series.

Python 116 16 Updated Aug 11, 2025

Community maintained fork of pdfminer - we fathom PDF

Python 6,658 992 Updated May 6, 2025

A very fast Python asyncio http and websockets client

Python 167 19 Updated Mar 28, 2025

🍰 Desktop utility to download images/videos/music/text from various websites, and more.

Python 25,885 2,341 Updated Feb 1, 2025

Python wrapper of the RuRe.

Rust 90 13 Updated Oct 30, 2019

An Elasticsearch client exposing DataFrame API

Python 283 46 Updated Apr 1, 2023

A blazingly fast JSON serializing & deserializing library

Go 8,516 410 Updated Aug 21, 2025

Super minimal python S3 cache

Python 4 Updated Sep 6, 2019

A web interface to extract tabular data from PDFs

Python 1,701 242 Updated Jan 3, 2025

A validation library for Pandas data frames using user-friendly schemas

Python 192 36 Updated Mar 24, 2023

A toolkit to run Ray applications on Kubernetes

Go 1,997 604 Updated Aug 21, 2025

Python Serverless Microframework for AWS

Python 10,908 1,004 Updated May 29, 2025

A cloud-native Pipeline resource.

Go 8,736 1,834 Updated Aug 22, 2025

AintQ Is Not Task Queue - a Python asyncio task queue on PostgreSQL.

Python 50 3 Updated Dec 26, 2022

An open source python library for automated feature engineering

Python 7,522 903 Updated Aug 21, 2025

An analytics database that puts JSON and relational tables on equal footing

Go 1,489 70 Updated Aug 21, 2025

Normalizing Flows in JAX 🌊

Python 289 17 Updated Jun 18, 2023

Extract data from websites using basic statistical magic

Python 505 41 Updated Oct 2, 2020

Dockerfile for libpostal-service based on the Who's on First implementation

Dockerfile 38 14 Updated May 29, 2025

A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀

Python 345 116 Updated Jul 31, 2025

Best and simplest tool for website change detection, web page monitoring, and website change alerts. Perfect for tracking content changes, price drops, restock alerts, and website defacement monito…

Python 26,261 1,456 Updated Aug 21, 2025

An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more

Python 837 204 Updated Aug 20, 2025

Keep code, data, containers under control with git and git-annex

Python 599 114 Updated Aug 18, 2025

An interactive PDF reader.

Python 429 58 Updated Jul 19, 2023

Represent, send, store and search multimodal data

Python 3,095 233 Updated Jun 17, 2025

Python tools for geographic data

Python 4,860 981 Updated Aug 18, 2025

Identify bias and measure fairness of your data

Python 95 10 Updated Aug 11, 2025

Leave One Feature Out Importance

Python 836 86 Updated Feb 14, 2025