Built a true self service business intelligence platform for 10+ line of businesses leveraging AtScale and allowing 300+ business users directly access, analyze and produce shared insights across the organization through live connections to 10 TB of big data and RDBMS sources via a unified semantic layer, regardless of where the data is stored.
Hadoop Infrastructure Engineering
Flexible and automated building of Hadoop clusters using Ansible playbooks. Built different Big data MapR clusters based on workload types ( Batch, Real-time, BI, ADHOC etc. ) supporting 100+ application teams.
Leveraged Password Vault for secret management, github for source code & configuration management.
Cluster designs, layouts, administration & performance tuning of the cluster services as well as upgrades, patches and monitoring.
Data Security & Governance
Created robust data security authorization framework for Big Data MapR & Cloudera platforms hosting 7+ Petabytes of production data covering 15+ data domains.
Created centralized access control, streamlined data policy management and consistent data sharing and compliance using Privacera , MapR ACE, Ranger KMS , Atlas & HSM for enterprise data lake platform.
Business Intelligence
Built a true self service business intelligence platform for 10+ line of businesses leveraging AtScale and allowing 300+ business users directly access, analyze and produce shared insights across the organization through live connections to 10 TB of big data and RDBMS sources via a unified semantic layer, regardless of where the data is stored.
Dremio SQL Acceleration
A Better BI Experience
Ad-hoc, mission-critical BI – and everything in between – directly on cloud data lake storage, without copying the data into warehouses, marts, extracts or cubes.
Data analysts and data scientists are empowered to discover, curate, analyze, and share datasets with a self-service mindset. Users can build interactive dashboards through native Dremio connectors in tools, such as Tableau and Power BI.
Complex Hadoop Migrations & Uplifts
Migrated 50+ complex application teams from RDBMS to Hadoop & from Hadoop MapR 5.2 to 6.1. The migration included building of various clusters based on workload type ( Batch, Real-time, BI, ADHOC etc. ) involving various tools & services like Hive, Spark, HBase, Map-Reduce.
Created migration plans, deployment architectures, security authentication & authorization, including data, metadata, jobs and user migration.
Involved in detailed test case creation and validation. Identity capacity needs and resource allocation via queues.
Real-time Processing Framework
Built real-time processing cluster for Fraud & Authentication services enabling them make real-time fraud decisions, fraud detection, prevention as well as perform fraud analytics.
Procured 27 nodes and built MapR real-time cluster hosting 3.2 Billion messages per month, installed MapR core services including spark and drill clients.
Created MapR-DB JSON tables and secondary indexes. Set-up Business continuity platform and replication set-up.
Helped application teams deploy microservices on top of the MapR-DB database.
Enterprise Data Catalog
Built Enterprise Data Catalog enabling metadata discovery, visualize data lineage, identify and isolation of sensitive data as well as data profiling using Lumada Data Catalog.
We use cookies to analyze website traffic and optimize your website experience. By accepting our use of cookies, your data will be aggregated with all other user data.