Cloudberry is a general-purpose system composed of distributed middleware on top of a big data management system, Apache AsterixDB, to support efficient real-time analytics and visualization on very large data sets.

Key Features

  • Semi-structured data model
  • Indexing support for spatial, temporal, and textual attributes
  • Sub-second response for expensive OLAP queries
  • Real-time analytics on fast data
  • Feed adapters for ingesting continuous data

Architecture

Demo: TwitterMap

This live prototype supports interactive analytics and visualization on more than one billion tweets with new data continuously being ingested. Check out this video.