Real-Time Analytics Platform

With our “Real-Time Analytics” platform, we offer you a solution that allows you to receive, process and analyze your data as streams in real time. Together with AWS’s IoT solution, our platform can cover a majority of streams of sensor data, services , social media, and so on.

In the platform, we have integrated leading technologies such as Mesos, Kafka, Cassandra and Spark streaming, which are also aligned with each other and build on industry standards such as Java. This also gives you support and development security and allows you to benefit from new developments.

Since we implement the platform in your “private cloud”, you have complete control over your data and high flexibility over your processes. Yet you still enjoy the benefits of a cloud-based solution such as low investment costs, high scalability, maximum mobility and consumption-based fees.

Components

Cassandra is one of the leading distributed and highly available No-SQL databases used by well-known companies such as Cisco, Credit Suisse, Disney, Ebay, Hp and many more. is in operation.

Cassandra is known for high availability and high throughput characteristics, and it is capable of handling enormous write loads and surviving cluster node failures. With respect to the CAP theorem, Cassandra provides configurable consistency and availability for operations.

In terms of data processing, Cassandra is linearly scalable (increased loads can be met by increasing the number of nodes in a cluster) and it is capable of cross-data center replication (XDCR). XDCR offers a number of interesting use cases for:

  • geo-distributed data centers: data specific to the region or closer to the customer.
  • Data center data migration: recovering from outages or moving data to a new datacenter.
  • Separate operational and analytics workloads: Separate clusters can be set up for write-intensive and analytics-intensive applications.

Cassandra is subject to the Apache 2.0 license.

Real-Time Processing

Streaming data is data that is generated continuously and from a variety of data sources. The data recordings are usually sent simultaneously and in small packets (kilobyte range). This data must be processed sequentially and incrementally on a per-record basis or in sliding time windows. Examples of streaming data are

  • Sensors from industrial equipment and machines send data to a real-time application to monitor production.
  • A financial company monitors financial transactions in real time for anomalies and fraud.
  • Evaluation of stock market performance and share performance using extra real-time data.
  • Online store evaluates visitor activity and the number of clicks in real time.
  • A real estate mobile app sends users suggestions of potential properties to view nearby, based on their location.

Unlike batch processing, the latency required for stream processing is in the range of seconds or milliseconds. This places special demands on both the processing and storage of the data. In addition, the system must be fail-safe and scalable.

Our real-time analytics platform is designed to meet the requirements and ensure that streaming data is processed with the required latency. With Kafka, Spark and Cassandra, we are building on technologies that have already proven themselves in many demanding real-time applications.

This post is also available in: German