docs/content/overview/what-is-rerun.md
Rerun covers the whole journey from raw recordings to training, on a single unified data layer for multi-rate, multimodal robotics data.
It's comprised of Rerun SDK: an open source library and tools for logging, storing, querying, visualizing, and training on multi-rate, multimodal data; and Rerun Hub: a data catalog and backend for large scale storage, access, and streaming of robotics data from object storage.
Building intelligent physical systems requires rapid iteration on both data and models. But teams often get stuck because:
The best robotics teams minimize their time from new data to training. Rerun gives you the unified infrastructure to make that happen.
Rerun is built for teams developing intelligent physical systems:
If you're working with robots, drones, autonomous vehicles, spatial AI, or any system with data that evolves over time, Rerun helps you move faster.
Use the logging API to log multimodal data from your code, or the chunk processing API to convert your existing data to the .rrd file format to later visualize or query.
<div class="d2-diagram"> </div>Rerun provides an open source pre-built viewer that is adjustable and extensible. You can log directly to the viewer, open a range of file formats to get data into the viewer, or even connect the viewer to a Rerun catalog.
<div class="d2-diagram"> </div>The Rerun file format supports both high performance visualization and querying over the same data source.
You can use the open source catalog server for running local laptop scale examples. We also offer Rerun Hub, a scalable catalog for robotic data, for teams that need collaborative dataset management, version control, and cloud storage (reach out to learn more). These are API compatible so the only difference from our examples to Rerun Hub is that you connect to an existing server instead of launching your own.
Before querying or viewing recordings on the catalog we have to register them. We group recordings as datasets. Since Rerun indexes existing data in place, registration needs paths to RRDs to index: in object store for Rerun Hub or on disk for local catalog server.
<div class="d2-diagram"> </div>At this point a viewer can connect to the prepared catalog or we show the basic steps to perform a query. We specify what dataset we want to query, get access to a lazy loaded dataframe, specify our query, and retrieve the results. Queries can be specified with SQL or dataframe APIs allowing the flexibility to investigate anything about your data.
<div class="d2-diagram"> </div>Use the catalog as a data source for training: a dataloader runs a query against the catalog and yields training batches.
direction: right
query: "query"
dataloader: "dataloader"
catalog: "Rerun Catalog" {
shape: cylinder
height: 130
}
batches: "batches"
query -> dataloader
dataloader -> catalog
batches <- catalog
Ready to speed up your iteration cycle?