Apache Impala Adds ODBC Scalar Functions
Written by Kay Ewbank   
Thursday, 17 May 2018

Apache Impala has been updated to version 3, with new features including the addition of ODBC scalar functions that were missing from the previous release. Impala is an open source, native analytic database for Apache Hadoop that provides a high-performance distributed SQL engine.

impala

Impala was originally developed by Cloudera, and donated to the Apache Software Foundation along with Apache Kudu. It can be used to run SQL queries on data stored in HDFS, HBase, Apache Kudu, Amazon S3, and Microsoft ADLS without requiring data movement or transformation. Impala is integrated with Hadoop to use the same ODBC driver,  file and data formats, metadata, security and resource management frameworks used by MapReduce, Apache Hive, Apache Pig and other Hadoop software. Impala performs well in analytics, and can interchange data with other Hadoop components as both a consumer and a producer. 

Impala keeps its table definitions in a standard MySQL or PostgreSQL database known as the metastore, in a similar way to how Apache Hive keeps this type of data. This means Impala can access tables defined or loaded by Hive. It also offers low latency and high concurrency for BI and analytic queries on Hadoop. Batch frameworks such as Apache Hive don't offer this option.

The new version has a number of improvements. The new features are the addition of a number of scalar functions to remove the need for ODBC driver translation for them, including Left, Right, Week, Quarter, and MonthName.

Support has also been added for Insert plan hints for for CREATE TABLE AS SELECT (CTAS). These were already supported for Insert statements. The improvements help tune ETL processes by making CTAS statements as efficient as when using Create + Insert + hints.

impala More Information

Impala Website

Related Articles

Apache Kudu Improves Web Interface

Hadoop SQL Query Engine Launched

Cloudera Impala Real Time Query On Hadoop 

Apache Arrow Adds Streaming Binary Format 

HBase Adds MultiWAL Support

Apache Kafka Adds New Streams API

Apache Beam Moves To Top Level

HBase Adds MultiWAL Support

Spark BI Gets Fine Grain Security

 

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.

 

Banner


Run WebAssembly Components Inside Node.js With Jco
28/03/2024

Jco 1.0 has been just announced by the Bytecode Alliance.It's a native JavaScript WebAssembly toolchain and runtime that runs Wasm components inside Node.js. Why is that useful?



AWS Adds Support For Llama2 And Mistral To SageMaker Canvas
12/03/2024

As part of its effort to enable its customers to use generative AI for tasks such as content generation and summarization, Amazon has added these state of the art LLMs to SageMaker Canvas.


More News

 

raspberry pi books

 

Comments




or email your comment to: comments@i-programmer.info

Last Updated ( Wednesday, 16 May 2018 )