Latest Updates & Insights

The Theseus Blog

Discover the latest insights, tutorials, and updates about
Composable Data Systems, Theseus, SQL, and AI-powered data
processing.

Latest Posts

Top 5 Challenges in Large-Scale Data Pipelines — And How GPU-Accelerated Analytics Unlocks Next-Level Innovation

Top 5 Challenges in Large-Scale Data Pipelines — And How GPU-Accelerated Analytics Unlocks Next-Level Innovation

Voltron Data
Voltron Data

Blog
Modern Data Governance and Security with Voltron Data's Theseus: Meeting the Requirements of EO-14028 and OMB M-21-31

Modern Data Governance and Security with Voltron Data's Theseus: Meeting the Requirements of EO-14028 and OMB M-21-31

Steven Morrow
Steven Morrow

Blog
Relentlessly Improving the Performance of our GPU Query Engine, Theseus

Relentlessly Improving the Performance of our GPU Query Engine, Theseus

Voltron Data
Voltron Data

Go Inside Arrow Database Connectivity: Roadmap, Background & Community

Go Inside Arrow Database Connectivity: Roadmap, Background & Community

M
Matt Topol
S
Srikanth Nadukudy

Adding A New Ibis-cuDF Backend With Zero Code Changes

Adding A New Ibis-cuDF Backend With Zero Code Changes

M
Marlene Mhangami
N
Nick Becker

Fast, Elegant, and Performant Geospatial Data Analysis with Arrow

Fast, Elegant, and Performant Geospatial Data Analysis with Arrow

K
Kae Suarez
D
Dewey Dunnington
and 2 more

ADBC Brings Composability to Industry Leading Data Tools, Stacks

ADBC Brings Composability to Industry Leading Data Tools, Stacks

K
Kae Suarez

GPUs for Analytics: An Experiment with Tuning, Chunking, Compression & Decompression

GPUs for Analytics: An Experiment with Tuning, Chunking, Compression & Decompression

J
Joost Hoozemans
K
Kae Suarez

Hugging Face Embeddings & Ibis: Create a Custom Search Engine for Your Data

Hugging Face Embeddings & Ibis: Create a Custom Search Engine for Your Data

M
Marlene Mhangami

The Standard Dataframe Language for Data Analysis and Data Engineering

The Standard Dataframe Language for Data Analysis and Data Engineering

M
Marlene Mhangami
F
Fernanda Foertter

nanoarrow: A Lightweight, Embeddable Arrow Implementation for Data Pipelines

nanoarrow: A Lightweight, Embeddable Arrow Implementation for Data Pipelines

K
Kae Suarez
D
Dewey Dunnington

Leverage Arrow and Ibis to Streamline Database Connectivity

Leverage Arrow and Ibis to Streamline Database Connectivity

K
Kae Suarez
P
Phillip Cloud

Dataframe Interoperability in Python: How PyArrow Enables Modular Workflows

Dataframe Interoperability in Python: How PyArrow Enables Modular Workflows

F
François Michonneau
A
Alenka Frim

Pass Data Between Python and R using Parquet & Arrow for Scalable Reporting

Pass Data Between Python and R using Parquet & Arrow for Scalable Reporting

F
François Michonneau

Use LLMs with Python UDFs to Query & Generate Tabular Data in Natural Language

Use LLMs with Python UDFs to Query & Generate Tabular Data in Natural Language

C
Cody Peterson
M
Marlene Mhangami

Ibis and Snowflake at the Speed of Arrow

Ibis and Snowflake at the Speed of Arrow

K
Kae Suarez
P
Phillip Cloud

Zero-Copy Sharing using Apache Arrow and Golang

Zero-Copy Sharing using Apache Arrow and Golang

M
Matthew Topol

Showing the Power of Parquet and Ibis Using UK Census Data

Showing the Power of Parquet and Ibis Using UK Census Data

K
Kae Suarez

How Polars Leverages Rust and Arrow For Faster Data Pipelines

How Polars Leverages Rust and Arrow For Faster Data Pipelines

M
Marlene Mhangami

Use LangChain & Ibis to Chat with Data Stored Anywhere

Use LangChain & Ibis to Chat with Data Stored Anywhere

M
Marlene Mhangami

Ibis 6.0 Preview: Supercharge Oracle Workflows with the Power of Python

Ibis 6.0 Preview: Supercharge Oracle Workflows with the Power of Python

K
Kae Suarez

Shopping for a Data Warehouse? Lower Costs Using Ibis to Benchmark Queries

Shopping for a Data Warehouse? Lower Costs Using Ibis to Benchmark Queries

F
François Michonneau

Data Transfer with Apache Arrow and Golang

Data Transfer with Apache Arrow and Golang

M
Matthew Topol

How to Connect & Analyze Data using Ibis in Snowflake

How to Connect & Analyze Data using Ibis in Snowflake

F
Fernanda Foertter
K
Kae Suarez

Give Your MySQL, MS SQL, or PostgreSQL Stack an Upgrade with Ibis

Give Your MySQL, MS SQL, or PostgreSQL Stack an Upgrade with Ibis

K
Kae Suarez

How Google Uses Ibis for its Data Validation Tool (DVT)

How Google Uses Ibis for its Data Validation Tool (DVT)

K
Kae Suarez

Make pandas Faster with DuckDB

Make pandas Faster with DuckDB

P
Phillip Cloud
K
Kae Suarez

Comparing Performance of ADBC and JDBC in Python for Arrow Flight SQL

Comparing Performance of ADBC and JDBC in Python for Arrow Flight SQL

P
Philip Moore
K
Kae Suarez

Scale From Local to Distributed Cloud Compute with Ibis, PySpark, and Amazon EMR

Scale From Local to Distributed Cloud Compute with Ibis, PySpark, and Amazon EMR

P
Philip Moore

arrow-go-work-with-data-files

Make Data Files Easier to Work With Using Golang and Apache Arrow

Matthew Topol
Matthew Topol
F
Felipe Aramburu

ArrowGo
Explore a New Way to Deploy Data Storage and Analytics with Arrow Flight SQL and Apache Superset

Explore a New Way to Deploy Data Storage and Analytics with Arrow Flight SQL and Apache Superset

K
Kae Suarez

Ibis 5.1: Faster file reading with DuckDB, Arrow-Native Workflows for Snowflake, and more

Ibis 5.1: Faster file reading with DuckDB, Arrow-Native Workflows for Snowflake, and more

K
Kae Suarez
A
Anja Boskovic

Scaling Out to Apache Spark with Ibis

Scaling Out to Apache Spark with Ibis

J
Jordan Volz

When Scale Matters, Don't Wait on pandas...

When Scale Matters, Don't Wait on pandas...

K
Kae Suarez

Breaking Down the First Principles of Ibis

Breaking Down the First Principles of Ibis

K
Kae Suarez

New Ibis Backend Shipped in 4 hours… Hello, Druid!

New Ibis Backend Shipped in 4 hours… Hello, Druid!

P
Phillip Cloud

What is Substrait? A High-Level Primer

What is Substrait? A High-Level Primer

K
Kae Suarez

Learn How Ibis Solves Your @Problems (Video)

Learn How Ibis Solves Your @Problems (Video)

K
Keith Britt

Speeds and Feeds: Hardware and Software Matter

Speeds and Feeds: Hardware and Software Matter

K
Keith Britt

Ibis: Upgraded Interface, Same Stack

Ibis: Upgraded Interface, Same Stack

K
Kae Suarez

Use Apache Arrow and Go for Your Data Workflows

Use Apache Arrow and Go for Your Data Workflows

M
Matt Topol

How to Use Snowflake and Ibis for Better Analytics

How to Use Snowflake and Ibis for Better Analytics

M
Marlene Mhangami

New Release: Ibis 5.0 Has Landed

New Release: Ibis 5.0 Has Landed

P
Patrick Clarke

SQL and Data Frames Unite with Ibis

SQL and Data Frames Unite with Ibis

M
Marlene Mhangami

b.telligent Makes Intelligent Use of Ibis

b.telligent Makes Intelligent Use of Ibis

K
Keith Britt

Shopping for a Data Warehouse? Put Workloads to the Test with Ibis

Shopping for a Data Warehouse? Put Workloads to the Test with Ibis

K
Kae Suarez

Ibis 5.0 Preview: Three Features to Get Excited About

Ibis 5.0 Preview: Three Features to Get Excited About

K
Kae Suarez
P
Phillip Cloud
and 1 more

From Laptop to Cloud: Ibis Connects With Your Data at Any Scale

From Laptop to Cloud: Ibis Connects With Your Data at Any Scale

K
Kae Suarez

The Top Python Tools to Analyze PUMS Census Data

The Top Python Tools to Analyze PUMS Census Data

M
Marlene Mhangami

Scaling Down: The Python Libraries You Need to Compress and Analyze the PUMS Dataset

Scaling Down: The Python Libraries You Need to Compress and Analyze the PUMS Dataset

K
Kae Suarez
M
Marlene Mhangami

Ibis: Easy, Performant, and Portable Python API for Data Analytics

Ibis: Easy, Performant, and Portable Python API for Data Analytics

F
Fernanda Foertter

Running an Arrow Flight SQL Server and Querying Data with JDBC and ADBC

Running an Arrow Flight SQL Server and Querying Data with JDBC and ADBC

P
Philip Moore
T
Tom Drabas
and 1 more

383 Ibis Expressions and the Only Language You Need is One

383 Ibis Expressions and the Only Language You Need is One

K
Keith Britt
P
Phillip Cloud
and 1 more

Ibis and Substrait: Supercharging Portability

Ibis and Substrait: Supercharging Portability

K
Kae Suarez

Quick Wins: Shortening Time to Analysis with Ibis 4.1

Quick Wins: Shortening Time to Analysis with Ibis 4.1

K
Kae Suarez

Making Big Data Feel Small: Analysis of Hacker News Stories with BigQuery and Ibis (Part 1)

Making Big Data Feel Small: Analysis of Hacker News Stories with BigQuery and Ibis (Part 1)

M
Marlene Mhangami

Arrow Database Connectivity: Apache Arrow for Every Database User

Arrow Database Connectivity: Apache Arrow for Every Database User

D
David Li

Apache Arrow Flight SQL: Arrow for Every Database Developer

Apache Arrow Flight SQL: Arrow for Every Database Developer

D
David Li

Inside Ibis: Contributors Weigh In Ahead of the 4.0 Release

Inside Ibis: Contributors Weigh In Ahead of the 4.0 Release

V
Voltron Data

Ibis Explained: Increasing Code Portability and Performance Gains

Ibis Explained: Increasing Code Portability and Performance Gains

P
Patrick Clarke

The Takeaway: Go and Apache Arrow at ApacheCon ‘22

The Takeaway: Go and Apache Arrow at ApacheCon ‘22

M
Matt Topol

Ibis Explained: Making DataFrames, Big and Small, More Delightful

Ibis Explained: Making DataFrames, Big and Small, More Delightful

P
Patrick Clarke
A
Alison Hill

Quick Wins: Accelerating pandas CSV Reading with Apache Arrow

Quick Wins: Accelerating pandas CSV Reading with Apache Arrow

K
Kae Suarez

Data Transfer Between Python and R with rpy2 and Apache Arrow

Data Transfer Between Python and R with rpy2 and Apache Arrow

D
Danielle Navarro

Ibis v3.2.0 Brings More Ways to Tackle Tabular Data

Ibis v3.2.0 Brings More Ways to Tackle Tabular Data

M
Marlene Mhangami

Passing Arrow Data Between R and Python with Reticulate

Passing Arrow Data Between R and Python with Reticulate

D
Danielle Navarro

Creating an Arrow Dataset

Creating an Arrow Dataset

F
Françios Michonneau

One Function. No Rewrites. Explore Ibis for Filling Null Values.

One Function. No Rewrites. Explore Ibis for Filling Null Values.

P
Patrick Clarke

Improving Apache Arrow One Change at a Time

Improving Apache Arrow One Change at a Time

S
Sam Albers
S
Stephanie Hazlitt

Quick Wins: Reading CSVs in R with Apache Arrow

Quick Wins: Reading CSVs in R with Apache Arrow

K
Keith Britt

Getting Started with Apache Arrow in R

Getting Started with Apache Arrow in R

D
Danielle Navarro
J
Jonathan Keane
and 1 more

Simplifying database connectivity with Arrow Flight SQL and ADBC

Simplifying database connectivity with Arrow Flight SQL and ADBC

D
David Li
T
Tom Drabas
and 1 more

Apache Arrow Version 9.0.0 Released

Apache Arrow Version 9.0.0 Released

A
Alessandro Molina
I
Ian Cook

Ibis v3.1.0 Release Brings New Features and Updates

Ibis v3.1.0 Release Brings New Features and Updates

P
Patrick Clarke

Introducing Arrow Flight SQL: The All-Star Database Connector

Introducing Arrow Flight SQL: The All-Star Database Connector

T
Tom Drabas
D
David Li

Serving Dataframes Over the Wire with Arrow Flight SQL and DuckDB

Serving Dataframes Over the Wire with Arrow Flight SQL and DuckDB

T
Tom Drabas
D
David Li
and 2 more

Data Transfer at the Speed of Flight

Data Transfer at the Speed of Flight

T
Tom Drabas
F
Fernanda Foertter
and 1 more

Engine Agnostic Analytics with Ibis

Engine Agnostic Analytics with Ibis

P
Patrick Clarke

Arrow 8.0.0 Release Brings New Functionality for PyArrow, Arrow Flight, C++ Engine, and More

Arrow 8.0.0 Release Brings New Functionality for PyArrow, Arrow Flight, C++ Engine, and More

A
Alessandro Molina
W
Will Jones

Apache Arrow New Contributor’s Guide

Apache Arrow New Contributor’s Guide

A
Alenka Frim

Introducing Substrait: An Interoperable Data to Engine Connector

Introducing Substrait: An Interoperable Data to Engine Connector

P
Phillip Cloud
J
Jacques Nadeau
and 3 more

Apache Arrow Version 7.0.0 Released

Apache Arrow Version 7.0.0 Released

A
Alessandro Molina
I
Ian Cook

Apache Arrow: Driving Columnar Analytics Performance and Connectivity

Apache Arrow: Driving Columnar Analytics Performance and Connectivity

W
Wes McKinney

Apache Arrow 7.0.0 – What to Expect

Apache Arrow 7.0.0 – What to Expect

A
Alessandro Molina
I
Ian Cook