Reltio Multi-Tenant Master Data Management DataStax Architecture Review
Reltio is a cloud-based MDM solution that unifies and cleanses multisource, complex data such as CRM contacts and leads into a single source of trusted information.
Reltio chose DataStax Enterprise Cassandra as the primary data store for its Cloud-native metadata-driven model and operations, due to its high performance columnar storage engine, fault tolerance, linear scalability and built-in multi-datacenter replication capability.
DataStax brought RhinoSource into Reltio to conduct a full review of Reltio’s DataStax Enterprise deployment on Amazon Web Services (AWS), including the Amazon AWS architecture, Cassandra workloads, data model, code, and DataStax cluster maintenance operations.
RhinoSource worked closely with the Reltio DevOps team and assembled the DataStax Enterprise operations runbook to ensure best practices were followed, as well as developed a statistical model that demonstrated that the optimum number of Cassandra vNodes (num_tokens parameter) to maximize cluster availability of large clusters, while still providing some level of load balancing, is in the 2-8 range.
RhinoSource assisted with load & stress testing to determine the best Cassandra data model design and storage engine configuration and made a number of recommendations to improve performance and reduce the amount of Cassandra data stored and therefore the total number of cluster nodes required, including the use of Solr Search indexes.
Finally, RhinoSource assisted with the redesign of a scalable multi-tenant data model and developed an automated data migration and single-tenant backup/restore process using Spark Scala jobs.
Products Serviced
DataStax Enterprise Cassandra
DataStax Enterprise Solr Search
DataStax Enterprise Spark
Scala Programming Language
DataStax OpsCenter
Amazon Web Services (AWS)
Services Performed
System Architecture and Configuration Review
System Health Check and Performance Audit
Data Model Review
Multi-Tenant Data Model Redesign
Spark Scala Data Migration Job Development
Performance Improvement and Scalability Recommendations
Troubleshooting and Issue Resolution
Performance Load & Stress Testing
System Operations Review and Runbook Preparation
Solr Index Design
Zero Downtime Data Migration
Proactive Monitoring and Alerting Recommendations
DevOps Scripting and Automation
More information on Reltio’s Cassandra use case: