Product

One integrated platform. Everything your data team needs.

Every module runs on the same Apache Spark + Kubernetes foundation. Same catalog, same RBAC, same operational surface, so switching modules isn't switching contexts.

Architecture

From cloud infrastructure to BI. One stack.

Layered on a battle-tested compute foundation, Spark on Kubernetes, with auto-scaling, multi-cluster, and cloud-agnostic deployment.

Business Intelligence

DashboardsChart WidgetsScheduled ReportsSwagger-backed APIs

Intelligence

ML StudioForecasterAnomaly DetectorSherlock (RCA)ProcBot

Delivery

Data Insider (REST)AnalyticsExternal system delivery

Processing

Transformer Module (Visual Canvas)JupyterContainerised Spark pipelines

Ingestion

X→Y (batch/on-demand)CDC (log, query, trigger)Advance ETL · 2000+ connectors

Governance

Data CatalogDataOpsSMART frameworkPlatform Admin & RBAC

Sources

RelationalNoSQLCloud storageKafkaRESTSaaSFiles

↑ Apache Spark · Kubernetes auto-scaling · Multi-cluster · Cloud-agnostic (AWS · Azure · GCP · Private) ↑

The modules

Grouped by what they do for you.

Each module is fully production-grade on its own. They share a data model, a security model, and a catalog, so teams stop paying the integration tax between them.

Ingestion

Data Ingester

Container-native pipeline engine with three modes: X→Y (batch/on-demand), Change Data Capture (log, query, and trigger-based), and Advance ETL over 2000+ connectors.

BatchCDCAdvance ETLContainer-native

Processing

Transformer Module

Visual no-code canvas for Spark transformation pipelines, plus a Jupyter notebook surface. Batch, streaming, and on-demand, the same engine, auto-scaled on Kubernetes.

Visual canvasSparkStreaming

Intelligence

ML Studio

AutoML, visual pipelines, and Jupyter in one place. One-click model deployment as versioned REST APIs. Drift detection and experiment tracking built in.

AutoMLDriftREST

Forecaster

25+ time-series algorithms: ARIMA, SARIMA, Prophet, LightGBM, XGBoost, N-BEATS, and more. Scheduled runs, accuracy dashboards, and side-by-side backtests.

Time-seriesML modelScheduled

Anomaly Detector

Near-real-time anomaly detection on SQL, Kafka, webhooks, and APIs. Continuous re-learning with separate workspaces for monitoring and model configuration.

Near-real-timeKafkaContinuous learning

Operations

Sherlock

Autonomous root-cause analysis. No-code decision trees execute on Spark, isolate failure points, and verify remediation with a closed-loop health check.

RCAAuto-remediationNo-code

ProcBot

No-code workflow automation. Event or schedule-driven; conditional logic, action blocks, and first-class integration with external systems.

WorkflowEvent-drivenNo-code

Delivery

Data Insider

Expose any SQL or NoSQL source as a versioned, secured REST API. Row and column-level security, rate limiting, auth, and auto-generated Swagger included.

Data-as-APIGatewayVersioning

Analytics

Dashboards, chart widgets, and tabular reports with a visual query builder. Scheduled delivery via email, SFTP, or API. RBAC-governed access end to end.

DashboardsReportsScheduled

Governance

Data Catalog

Cross-tool lineage, automated discovery, business glossary, classification, and automated PII tagging. One catalog for every module.

LineagePIIGlossaryDiscovery

DataOps

Operational command centre: live pipeline health, SLA tracking, failure-pattern analysis, queue and resource trends, and AI-generated recommendations.

Command centreSLAAI insights

Platform Admin

Unified RBAC, user and role management, data-source administration, and BI enablement. One security model for every module.

RBACAdminGovernance

Agentic AI layer

Agents across platform.

A growing library of agents ships with the platform, regardless of which modules you turn on. No separate contract, no separate model to manage.

Talk to Your Data

Plain-English queries over SQL, NoSQL, S3, Cassandra, and APIs.

ETL/ELT Designer

Describe the requirement, and the agent ships a deployment-ready pipeline.

Spark Summarizer

Turns verbose Spark logs into "what ran, failed, was slow, fix this."

ProcBot Designer

Describe a process, and the agent generates the working script in bash, Python, Terraform, or Ansible.

Sherlock

Autonomous agents help with not only problem discovery but throughout the process from problem detection to auto-remediation and closure.

DataOps AI

Ask about pipeline health and SLAs; answers come from live telemetry.

AI Governance & Intelligence

AI agents continuously enrich metadata, classify sensitive data, monitor compliance, and generate governance insights across enterprise data assets.

Data Exploration AI

Describe the requirements in natural language, and the agent generates the transformation code behind the scenes to produce the output.

Integrations

Connects to where your data already lives.

Two thousand plus connectors across six categories, delivered through the Advance ETL engine. Drag-and-drop by default; custom code when the source demands it.

Databases & warehouses

PostgreSQL
MySQL
Oracle
SQL Server
Snowflake
BigQuery
Redshift
MongoDB
Cassandra

Cloud storage

AWS S3
Azure Blob
GCS
Azure Data Lake
HDFS
MinIO

Streaming & messaging

Apache Kafka
AWS Kinesis
Azure Event Hubs
RabbitMQ
Webhooks
REST APIs

SaaS & applications

Salesforce
SAP
ServiceNow
Workday
HubSpot
Zendesk
Jira
+ 280 more

BI & reporting

Power BI
Tableau
Looker
Excel
SFTP export
Email delivery

File formats

CSV / JSON / XML
Parquet
Avro
ORC
FTP / SFTP

See it running on your stack.

Thirty-minute walkthrough. Your data, your connectors, real pipelines. No slideware.

Get in touch Explore the product