The open source ELT framework powered by Apache Arrow
-
Updated
Aug 9, 2025 - Go
The open source ELT framework powered by Apache Arrow
Serverless multi-protocol + multi-destination event collection system.
A Content Discovery and Development Platform. Empowering Cybersecurity, AI, Marketing, and Finance professionals and researchers to discover, analyze, and interact with the web in all its dimensions.
rtdl makes it easy to build and maintain a real-time data lake
Simple, RESTful Log Collector
Sistema para escalonamento e orquestração de execuções, visando a automatização de processos do DadosJusBR
CloudQuery Provider for Scaleway
DataDigger is a powerful and intuitive web application designed to extract and analyze data from web pages.
high performance better alternative to Airbyte, Singer, Meltano
A command-line tool in Go that extracts meaningful text from web pages, filters out unwanted elements, and outputs clean text for easy integration with AI applications, data mining, and web scraping.
VRChat database to store uploaded data from mods
Web scraping API written in Golang for business data collection.
Helper library to provide event handlers to gpioc-dev library.
The Ethereum Gas Price Extractor is a Go program designed to extract a sample of gas price values from successful Ethereum transactions for every block using the Etherscan and Infura APIs.
Add a description, image, and links to the data-collection topic page so that developers can more easily learn about it.
To associate your repository with the data-collection topic, visit your repo's landing page and select "manage topics."