Q: What is MapR Event Store?
Q: What is an event stream?
Q: What is a publish-subscribe model?
Q: How is MapR Event Store related to the MapR Data Platform?
Q: What are the industry challenges for event streams?
Q: How does MapR Event Store address these challenges?
Q: What are the key features of MapR Event Store?
Q: Who will benefit from using MapR Event Store?
Q: What is the relationship between MapR Event Store and other “streaming” components like Spark, Storm, Apex, and Flink?
Q: How does MapR Event Store compare with Kafka?
Q: What are some of the functional use cases for MapR Event Store?
Q: What are some vertical use cases that can benefit from MapR Event Store?
Q: How can I try MapR Event Store?
Q: How can I buy MapR Event Store?
Q: How do I use MapR Event Store if I already have MapR?
Q: What version of MapR do I need to use MapR Event Store?
MapR Event Store is a global publish-subscribe event streaming system for big data. It connects data producers and consumers worldwide in real time, with unlimited scale. Publishers (data producers) write data to one or more topics in MapR Event Store. Subscribers (data consumers) to the topic can read the data instantaneously, anywhere across the globe.
MapR Event Store is unique for two reasons. First, it is the first big data-scale streaming system to be built into a converged data platform. Next, it is the only big data streaming system to support global event replication at Internet-of-Things (IoT) scale and reliability, providing failover endpoints between up to thousands of distributed clusters.
An event stream is a continuous flow of event data that is transported between multiple applications or services. The events are typically generated by diverse data sources including web applications, system logs, social media, sensors, connected devices, and machine logs. In contrast to the types of events that legacy message queue-oriented systems were designed to handle, big data event streams are often generated by millions of sources worldwide, reaching millions or sometimes billions of events per second.
Publish-subscribe is a messaging paradigm where the data producers (referred to as publishers) do not directly send the data to data consumers. Instead, they publish the data to a system that manages “topics.” The data consumers (referred to as subscribers) subscribe to relevant topics to retrieve the data. This model allows publishers and subscribers to publish and subscribe without knowledge of each other, at different rates.
MapR Event Store is an integral part of the MapR Converged Data Platform, which also includes file, storage, database services, and processing frameworks in a single cluster. Furthermore, batch, interactive, and stream processing frameworks have direct access to event streams, eliminating data movement and ensuring consistency. It derives enterprise features such as secure access control, encryption, multi-tenancy, and strong consistency from the MapR Converged Data Platform.
Data volume and diversity: Modern businesses are being overwhelmed by the onslaught of data created continuously by diverse sources such as web applications, social media, sensors, connected devices, and machine logs, to name a few.
Geographic dispersion: To add to the complexity, the diverse sources mentioned above are often geographically distributed, sending data to the closest data center for low latency. This distributed data needs to be centralized and joined with data from enterprise applications to paint a complete picture of the state of business.
Delayed processing and insights: Although the data is created continuously, it is consumed for transformation, movement, or processing at a predetermined frequency. This introduces data pipeline complexity and precludes the ability to respond immediately to new information.
Architectural complexity: Businesses typically deploy data transport systems and data processing systems in separate clusters. This creates complexity in analyzing new data available in the data streams in real time, as well as administrative overhead of managing separate clusters.
MapR Event Store provides a reliable, globally scalable streaming system that connects data producers and consumers via topics. MapR Event Store is integrated into one converged data platform with file, database, and stream processing services.
Converged data platform reduces architectural complexity for streaming: MapR Event Store brings together data transport and data processing in the same cluster. Batch, interactive, and stream processing frameworks have direct access to event streams, eliminating data movement and ensuring consistency. Like other services in the MapR Converged Data Platform, MapR Event Store provides enterprise features such as secure access control, encryption, and multi-tenancy.
Continuous real-time data processing avoids delayed processing and insights: MapR Event Store makes real-time data directly available for processing. Real-time data can be processed by stream processing frameworks such as Spark Streaming to enable sub-second response and automated actions. Enterprise features such as high availability with no single point of failure and disaster recovery mirroring ensure that your system is always on for business critical environments.
Global scalability handles data diversity and geographic dispersion: MapR Event Store scales linearly as nodes are added, allowing billions of events per second to be sent across billions of topics. Further, MapR Event Store is designed for geographically dispersed systems, with real-time global replication. You can access data created at multiple geographical locations, and process it in real time to get a complete state-of-the-business picture. Producers and consumers can failover between distributed clusters for high availability.
Business leaders: Improve responsiveness to critical events with continuous processing of real-time big data.
Enterprise architects/lead engineers: Simplify the flow of data across data sources, formats, and locations, to reduce architectural complexity and TCO.
Developers: Improve time-to-market for advanced data streaming applications to meet the growing demands of your stakeholders.
MapR Event Store provides the reliable data ingestion, transport, and buffering for stream processing frameworks such as Spark, Storm, Apex and Flink. These stream processing frameworks are fully integrated with MapR Event Store, and work together to enable real-time global streaming analytics.
MapR Event Store is similar to Kafka, as both systems use the same API for publish and subscribe. What differentiates MapR Event Store is its proven enterprise features such as global replication, security and multi-tenancy, and HA/DR, all of which it inherits from the MapR Converged Data Platform.
Stream processing: MapR Event Store provides the ingest, transport, and buffering layer for stream processing frameworks such as Spark Streaming to enable real-time operations such as calculations and aggregations on data as it’s delivered.
Database change capture: Change capture keeps the operational system-of-record synchronized with other systems.
Application logs and metrics delivery: MapR Event Store can provide a pipeline for log/metrics data coming out of appliances, servers, and applications, making them available to infrastructure monitoring systems for alerting, dashboarding, and search.
When MapR 5.1 becomes available in early 2016, MapR will provide MapR Event Store as part of the free MapR Converged Community Edition. We will also release a virtual machine sandbox with MapR Event Store along with tutorials, sample code, and video demos to make getting started easy.
You can license MapR Event Store as an individual product or bundle it with our other enterprise products. Please reach out to firstname.lastname@example.org to learn more.
You can license MapR Event Store as an add-on to your current MapR Distribution. Please reach out to email@example.com to learn more.
5.1 and greater.