THESEUS

THESEUS:The SQL Engine for AI Workloads

GPU-accelerated. Distributed. AI-native. Orchestrate your AI pipelines with blazing fast SQL.

Python

1# Import required libraries
2from theseus.cluster import GatewayWorkerCluster
3import ibis
4
5# Initialize a Theseus cluster with 1 worker
6tc = GatewayWorkerCluster(n_workers=1).__enter__()
7
8# Connect to Theseus using Ibis and the cluster's location
9con = ibis.theseus.connect(tc._location)
10
11# Read a Parquet file into Theseus
12con.read_parquet('example.parquet', table_name='example')
13
14# Execute a SQL query and convert results to pandas DataFrame
15con.sql('select * from example limit 10').to_pandas()

Trusted by Industry Leaders

GPU Accelerated SQL is Fast

[Lower is faster, to the left is cheaper]

Complete Benchmarking ReportBenchmarking Report

By rethinking query execution for GPUs, Theseus parallelizes complex queries and outperforms leading engines, delivering results in seconds.

How it works

0x

Cheaper

0x

Faster

0x

Fewer Nodes

GenAI Meets SQL

Process data and drive LLM workflows on the same GPU. Theseus adds GPU‑powered UDFs to SQL, enabling full pipelines, from embedding and vector search to RAG in Dataframes and SQL.

Check Out RAG UDFs

Your Data.
Your Format.
Your Tables.

Theseus provides fast, in-place queries for data lakes, lakehouses, and warehouses, and supports Apache Iceberg and standard file formats.

Read the EBook

Theseus Control Plane

Powered by a Kubernetes-native control plane, Theseus deploys across public and private clouds using existing authentication and air-gapped support for maximum security and compliance.

Explore the Control Plane

Theseus Query Profiler

Built for heterogeneous, accelerator-native environments, the Theseus Query Profiler monitors the engine at run-time, enabling real-time analysis of query performance at CPU, GPU, and I/O layers with zero overhead.