Reltio Multi-Tenant Master Data Management DataStax Architecture Review

Reltio is a cloud-based MDM solution that unifies and cleanses multisource, complex data such as CRM contacts and leads into a single source of trusted information.

Reltio chose DataStax Enterprise Cassandra as the primary data store for its Cloud-native metadata-driven model and operations, due to its high performance columnar storage engine, fault tolerance, linear scalability and built-in multi-datacenter replication capability.

DataStax brought RhinoSource into Reltio to conduct a full review of Reltio’s DataStax Enterprise deployment on Amazon Web Services (AWS), including the Amazon AWS architecture, Cassandra workloads, data model, code, and DataStax cluster maintenance operations.

RhinoSource worked closely with the Reltio DevOps team and assembled the DataStax Enterprise operations runbook to ensure best practices were followed, as well as developed a statistical model that demonstrated that the optimum number of Cassandra vNodes (num_tokens parameter) to maximize cluster availability of large clusters, while still providing some level of load balancing, is in the 2-8 range.

RhinoSource assisted with load & stress testing to determine the best Cassandra data model design and storage engine configuration and made a number of recommendations to improve performance and reduce the amount of Cassandra data stored and therefore the total number of cluster nodes required, including the use of Solr Search indexes.

Finally, RhinoSource assisted with the redesign of a scalable multi-tenant data model and developed an automated data migration and single-tenant backup/restore process using Spark Scala jobs.

Products Serviced

  • DataStax Enterprise Cassandra

  • DataStax Enterprise Solr Search

  • DataStax Enterprise Spark

  • Scala Programming Language

  • DataStax OpsCenter

  • Amazon Web Services (AWS)

Services Performed

  • System Architecture and Configuration Review

  • System Health Check and Performance Audit

  • Data Model Review

  • Multi-Tenant Data Model Redesign

  • Spark Scala Data Migration Job Development

  • Performance Improvement and Scalability Recommendations

  • Troubleshooting and Issue Resolution

  • Performance Load & Stress Testing

  • System Operations Review and Runbook Preparation

  • Solr Index Design

  • Zero Downtime Data Migration

  • Proactive Monitoring and Alerting Recommendations

  • DevOps Scripting and Automation

More information on Reltio’s Cassandra use case:

Visit the Site→

Previous
Previous

Akamai Technologies Global Traffic Management Cassandra Risk Assessment

Next
Next

[24]7.ai Real-Time Customer Messaging DataStax Architecture Review