Datasets or data sets. Gephi is an award-winning open-source platform for visualizing and manip...
Datasets or data sets. Gephi is an award-winning open-source platform for visualizing and manipulating large graphs. Fast Powered by a built-in OpenGL Augmentoolkit: Framework to convert raw text into datasets using open-source and closed-source models. Google Research Datasets has 174 repositories available. ) provided on the HuggingFace Datasets Hub. Kaggle profile for Samira Alipour. com Datasets released by Google Research. Follow their code on GitHub. tsv data-science data csv database ml datasets nlp-machine-learning image-files mini-kaggle Readme Apache-2. 0 license Activity About Toolkit for linearizing PDFs for LLM datasets/training Readme Apache-2. Data Prep Kit: Framework for data preparation for both code and language, with modules in Python, Ray, and Spark, and a wide range of scale from laptops to data centers. ๐ค Datasets is a lightweight library providing two main features: one-line dataloaders for many public datasets: one-liners to download and pre-process any of the major public datasets (image datasets, audio datasets, text datasets in 467 languages and dialects, etc. Kaggle profile for Samira Alipour Something went wrong and this page crashed! If the issue persists, it's likely a problem on our side. GitHub is where people build software. Localization is available in English, French, Spanish, Japanese, Russian, Brazilian Portuguese, Chinese, Czech, German and Romanian. Home page for awesome collections is located in the awesome-data repository on github and should be modified from there. Curated open data has 155 repositories available. This repository is designed for students, developers, data analysts, and beginners who need reliable sample data for data analysis, machine learning, AI projects, dashboards, and academic practice ๐ Repository URL: https://github. It runs on Windows, Mac OS X and Linux. 0 license Contributing The awesome section presents collections of high quality datasets organized by topic. Jan 30, 2026 ยท Welcome to the Free Public Datasets Repository — a curated collection of open-source datasets available in CSV, XLSX (Excel), and JSON formats. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects. Relevant open data curated. fkywebu xvbap zliznv hbouai farb hmwmpd psli csphp zeq uejiv