About the Simba Apache Spark ODBC Connector

The Simba Apache Spark ODBC Connector is used for direct SQL and HiveQL access to Apache Hadoop / Spark distributions, enabling Business Intelligence (BI), analytics, and reporting on Hadoop-based data. The connector efficiently transforms an application’s SQL query into the equivalent form in HiveQL, which is a subset of SQL-92. If an application is Spark-aware, then the connector is configurable to pass the query through to the database for processing. The connector interrogates Spark to obtain schema information to present to a SQL-based application. Queries, including joins, are translated from SQL to HiveQL. For more information about the differences between HiveQL and SQL, see SQL Connector for HiveQL.

The Simba Apache Spark ODBC Connector complies with the ODBC 3.80 data standard and adds important functionality such as Unicode and 32- and 64-bit support for high-performance computing environments.

ODBC is one of the most established and widely supported APIs for connecting to and working with databases. At the heart of the technology is the ODBC connector, which connects an application to the database. For more information about ODBC, see: https://insightsoftware.com/blog/what-is-odbc/. For complete information about the ODBC specification, see the ODBC API Reference from the Microsoft documentation: https://docs.microsoft.com/en-us/sql/odbc/reference/syntax/odbc-api-reference.

The Simba Apache Spark ODBC Connector is available for Microsoft® Windows®, Linux, and macOS platforms.

Note:

The AIX and Solaris connectors are not available through the Simba website. To get these connectors, contact the Sales & Solutions team:

Note:

This is the most up-to-date version of the Installation and Configuration Guide, for use with version 2.8.2 of the connector. If you are using an older version of the connector, certain features may not be available and certain settings may behave in unexpected ways. Please consult the PDF version of the Installation and Configuration Guide that was installed with your connector.

The Installation and Configuration Guide is suitable for users who are looking to access data residing within Hadoop from their desktop environment. Application developers might also find the information helpful. Refer to your application for details on connecting via ODBC.

Note:

For basic configuration instructions that allow you to quickly set up the Windows connector so that you can evaluate and use it, see the Simba ODBC Connectors Quick Start Guide for Windows. The Quick Start Guide also explains how to use the connector in various applications.

  • Features
  • Installing the Driver on Windows on page 1
  • Installing the Driver on page 1
  • Installing the Connector
  • Configuring the Driver on page 1
  • Driver Configuration Options on page 1