Search: [big-data] - Biapy Web Directory

Trunk Data Platform (TDP) https://www.trunkdataplatform.io/

Mon Nov 27 12:01:20 2023

📧email

open source big data platform.

Trunk Data Platform is an Open Source, free, Hadoop distribution.

Apache Druid https://druid.apache.org/

Thu Aug 24 10:11:00 2023

📧email

Druid is a high performance, real-time analytics database that delivers sub-second queries on streaming and batch data at scale and under load.

XetHub: fast, frictionless collaboration at scale https://xethub.com/

Wed Jan 4 13:46:49 2023

📧email

XetHub brings speedy access and Git-based collaboration to large scale repositories of data, code, or any combination of files.
Our instant mount feature makes it possible to access GBs and TBs of data in seconds at the speed of localhost, while our de-duplication algorithm stores data and differences efficiently to save money and speed up development cycles.
XetHub is ideal for teams who already use Git to track their code changes, and want to leverage the power of infinite history, pull requests, and difference-based tracking for larger assets such as datasets or media files. Managing complete projects with familiar Git semantics makes change tracking and continuous integration a breeze, especially for workflows that use code to generate or augment assets.

Neo4j https://neo4j.com/

Fri Oct 21 09:11:59 2022

📧email

Graph Database Management System.
Neo4j Graph Data Platform. Blazing-Fast Graph, Petabyte Scale.
With proven trillion+ entity performance, developers, data scientists, and enterprises rely on Neo4j as the top choice for high-performance, scalable analytics, intelligent app development, and advanced AI/ML pipelines.

ClickHouse https://github.com/ClickHouse/ClickHouse

Fri Oct 14 08:48:20 2022

📧email

ClickHouse® is a free analytics DBMS for big data.
ClickHouse® is an open-source column-oriented database management system that allows generating analytical data reports in real-time.

Konbert https://konbert.com/

Mon Oct 10 08:27:23 2022

📧email

Open big JSON, CSV Files: Online Viewer, Explorer and Converter.
View and convert big data files.
View large or small files right in your browser and export them in any format.

Planet https://www.planet.com/

Wed Feb 2 10:36:38 2022

📧email

Daily Earth Data to See Change and Make Better Decisions.
Planet provides daily satellite data that helps businesses, governments, researchers, and journalists understand the physical world and take action.

Climate TRACE https://www.climatetrace.org/

Fri Jan 28 12:31:53 2022

📧email

Climate TRACE was built to collect and share greenhouse gas emissions from anthropogenic (human) activities to facilitate climate action .

Robtex https://www.robtex.com/

Tue Jan 25 09:55:01 2022

📧email

Robtex is used for various kinds of research of IP numbers, Domain names, etc.
Robtex uses various sources to gather public information about IP numbers, domain names, host names, Autonomous systems, routes etc. It then indexes the data in a big database and provide free access to the data.
We aim to make the fastest and most comprehensive free DNS lookup tool on the Internet.
Our database now contains billions of documents of internet data collected over more than a decade.

Luna https://www.luna-lang.org/

Tue Aug 7 22:27:45 2018

📧email

A WYSIWYG language for data processing.

EveryPolitician http://everypolitician.org/

Sat Dec 31 02:38:26 2016

📧email

Political data for 233 countries.
The world’s richest open dataset on politicians

PNDA http://pndaproject.io/

Thu Jul 14 17:10:51 2016

📧email

The scalable, open source
big data analytics platform
for networks and services.

ROOT a Data analysis Framework https://root.cern.ch/

Thu Apr 28 18:52:48 2016

📧email

A modular scientific software framework. It provides all the functionalities needed to deal with big data processing, statistical analysis, visualisation and storage. It is mainly written in C++ but integrated with other languages such as Python and R.

OpenGrid https://github.com/Chicago/opengrid

Sat Mar 19 01:09:42 2016

📧email

A user-friendly, map-based tool to combine and explore real-time or historical data.

WhereHows https://github.com/linkedin/WhereHows

Mon Mar 7 07:55:56 2016

📧email

WhereHows is a data discovery and lineage tool built at LinkedIn. It integrates with all the major data processing systems and collects both catalog and operational metadata from them.

GridDB https://github.com/griddb/griddb_nosql

Mon Feb 29 23:56:51 2016

📧email

high performance, high scalability and high reliability database for big data.
GridDB has a KVS (Key-Value Store)-type data model that is suitable for sensor data stored in a timeseries. It is a database that can be easily scaled-out according to the number of sensors.

HBase - Apache HBase™ Home http://hbase.apache.org/

Wed Mar 5 22:05:57 2014

📧email

Apache HBase™ is the Hadoop database, a distributed, scalable, big data store.
Use Apache HBase when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware. Apache HBase is an open-source, distributed, versioned, non-relational database modeled after Google's Bigtable: A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, Apache HBase provides Bigtable-like capabilities on top of Hadoop and HDFS.

D3.js - Data-Driven Documents http://d3js.org/

Wed Mar 5 10:58:31 2014

📧email

D3.js is a JavaScript library for manipulating documents based on data. D3 helps you bring data to life using HTML, SVG and CSS. D3’s emphasis on web standards gives you the full capabilities of modern browsers without tying yourself to a proprietary framework, combining powerful visualization components and a data-driven approach to DOM manipulation.

Waarp https://github.com/Waarp

Fri Jun 22 08:34:43 2012

📧email

Waarp provides a secure and efficient open source MFT solution.

Apache™ Hadoop™! http://hadoop.apache.org/

Wed Mar 21 17:21:29 2012

📧email

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming model. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-avaiability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-availabile service on top of a cluster of computers, each of which may be prone to failures.

Links per page

Filters