Datasette
Groovy Datasets for Test Databases - Redis
redis.ioKyle Steinike added
WebDataset
WebDataset is a library for writing I/O pipelines for large datasets. Its sequential I/O and sharding features make it especially useful for streaming large-scale datasets to a DataLoader.
The WebDataset format
A WebDataset file is a TAR archive containing a series of data files. All successive data files with the same prefix are consider... See more
WebDataset is a library for writing I/O pipelines for large datasets. Its sequential I/O and sharding features make it especially useful for streaming large-scale datasets to a DataLoader.
The WebDataset format
A WebDataset file is a TAR archive containing a series of data files. All successive data files with the same prefix are consider... See more
WebDataset
Nicolay Gerold added
Magical tools
for working
with data
Queries, notebooks, reports, data apps, and AI — all in the world’s leading collaborative data workspace.
for working
with data
Queries, notebooks, reports, data apps, and AI — all in the world’s leading collaborative data workspace.
Hex - Do more with data, together.
Nicolay Gerold added
Patterns when telling stories with data: - What is the dataset? Who generated the dataset and why? - What is the process that underpins the dataset? Given that process, what is missing from the dataset or has been poorly measured? Could other datasets have been generated, and if so, how different could they have been to the one that we have? - What
... See moreJohann Van Tonder added
Jilber Najem and added
Where data teams go deeper, faster | Observable
observablehq.comcássius carvalho and added
I went to a ClickHouse meetup here in Dubai.
Never used the tool, but I'm happy to learn.
ClickHouse is:
- a database
- built for analytics
- open source
Furthermore, it:
- can be run on a single node or in a cluster
- stores data in columnar format
- uses both "vectorized query execution" and "runtime code generation" to maximize CPU usage
In the meetup, Cl... See more
Never used the tool, but I'm happy to learn.
ClickHouse is:
- a database
- built for analytics
- open source
Furthermore, it:
- can be run on a single node or in a cluster
- stores data in columnar format
- uses both "vectorized query execution" and "runtime code generation" to maximize CPU usage
In the meetup, Cl... See more
Feed | LinkedIn
Ultimate Repo for Data Teams
notion.castordoc.comJilber Najem added