DuckDB 1.0 Released
Written by Kay Ewbank   
Tuesday, 04 June 2024

DuckDB 1.0 has been released following a successful release of the 0.10 version back in February that introduced a version of DuckDB with both forward and backward compatibility for DuckDB's storage format. 

DuckDB is a relational database that has already gained popularity with millions of downloads a month. Download traffic for just the DuckDB extensions is running at upwards of four terabytes a day.

duckdb

The developers describe DuckDB's advantages as simplicity, portability, speed, feature richness, and the fact it's free. They say the simplicity comes from the fact it is easy to install, and has embedded in-process operation. DuckDB has no external dependencies either at compilation or runtime. It doesn't run as a separate process, but is completely embedded within a host process. This ensures high-speed data transfer to and from the database for analytical uses.

The lack of dependencies ensures portability. DuckDB can be compiled for all major operating systems, and will run on machines ranging from small, resource-constrained edge devices to large multi-terabyte memory servers with 100+ CPU cores. Using DuckDB-Wasm, DuckDB can also run in web browsers and even on mobile phones.

DuckDB provides APIs for Java, C, C++, Go, Node.js, Julia, Python, R, Rust, Swift, ODBC and Wasm. DuckDB is deeply integrated into Python and R for efficient interactive data analysis.

In terms of features, DuckDB offers support for complex queries in SQL with a large function library and window functions. It provides transactional guarantees (ACID properties) , and data can be stored in persistent, single-file databases. DuckDB supports secondary indexes to speed up queries trying to find a single table entry.

DuckDB's speed is another attraction. It is designed to support analytical query workloads, which tend to have complex, relatively long-running queries that process significant portions of the stored dataset. To support this, DuckDB contains a columnar-vectorized query execution engine, where queries are still interpreted, but a large batch of values are processed in one operation.

DuckDB also offers a flexible extension mechanism that allows defining new data types, functions, file formats and new SQL syntax. The developers say that many of DuckDB's key features, such as support for the Parquet file format, JSON, time zones, and supports for the HTTP(S) and S3 protocols are implemented as extensions.

DuckDB is open-source, the entire source code is freely available on GitHub. The team is holding its next community event, DuckCon #5, in Seattle on August 15, and attendance is free.

 duckdb

More Information

DuckDB Website

Related Articles

Ibis 8 Adds Streaming

Apache Arrow 5 Improves Asynchronous Scanner

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.

Banner


AI At edX With 30% Savings
13/12/2024

edX is offering a 30% discount on selected courses and program bundles until December 19th. We look at  AI-related certifications that could boost your resume in 2025.



pg_parquet - Postgres To Parquet Interoperability
28/11/2024

pg_parquet is a new extension by Crunchy Data that allows a PostgreSQL instance to work with Parquet files. With pg_duckdb, pg_analytics and pg_mooncake all of which can access Parquet files, is  [ ... ]


More News

espbook

 

Comments




or email your comment to: comments@i-programmer.info

Last Updated ( Tuesday, 04 June 2024 )