Driver Setup

To set up the Simba Spark ODBC Driver to connect to your data store, do the following:

  • Install the driver on the machine that you use to access the data. For more information, see Installing the Driver.
  • Configure the driver by creating a Data Source Name (DSN) that specifies the necessary connection properties. For more information, see Configuring the Driver.

Installing the Driver

On 64-bit Windows operating systems, you can execute 32- and 64-bit applications transparently. You must use the version of the driver matching the bitness of the client application accessing data in Hadoop / Spark:

  • SimbaSparkODBC32.msi for 32-bit applications
  • SimbaSparkODBC64.msi for 64-bit applications

You can install both versions of the driver on the same machine.

You install the Simba Spark ODBC Driver on client machines that access data stored in a Hadoop cluster with the Spark service installed and running. Each machine that you install the driver on must meet the following minimum system requirements:

  • One of the following operating systems:
    • Windows Vista, 7, 8, or 10
    • Windows Server 2008 or later
  • 100 MB of available disk space

Important:

To install the driver, you must have Administrator privileges on the machine.

To install the Simba Spark ODBC Driver:

  1. Depending on the bitness of your client application, double-click to run SimbaSparkODBC32.msi or SimbaSparkODBC64.msi.
  2. Click Next.
  3. Select the check box to accept the terms of the License Agreement if you agree, and then click Next.
  4. To change the installation location, click Change, then browse to the desired folder, and then click OK. To accept the installation location, click Next.
  5. Click Install.
  6. When the installation completes, click Finish.
  7. If you received a license file via e-mail, then copy the license file into the \lib subfolder in the installation folder you selected above.
  8. Note:

    To avoid security issues, you might need to save the license file on your local machine prior to copying the file into the \lib subfolder.

Configuring the Driver

To use the Simba Spark ODBC Driver in a client application, you need to configure a Data Source Name (DSN) to connect to your data store.

To create a Data Source Name:

  1. Open the ODBC Administrator:
    • If you are using Windows 7 or earlier, click Start Windows Start button, then click All Programs, then click the Simba Spark ODBC Driver program group corresponding to the bitness of the client application accessing data in Hadoop / Spark, and then click ODBC Administrator.
    • Or, if you are using Windows 8 or later, on the Start screen, type ODBC administrator, and then click the ODBC Administrator search result corresponding to the bitness of the client application accessing data in Hadoop or Spark.
  2. Click the System DSN tab.
  3. In the list of DSNs, select Sample Simba Spark DSN, and then click Configure.
  4. In the Host field, type the IP address or host name of your Spark server.
  5. Note:

    If you are running Spark on your local machine, then type localhost in the Host field.

  6. In the Port field, type the number of the TCP port that the Spark server uses to listen for client requests.
  7. Note:

    By default, Spark uses port 10000.

  8. In the Database field, type the name of the database that you want to access.
  9. From the Spark Server Type drop-down list, select the server type used by your data store.
  10. If your Spark server is configured to use authentication, then select the appropriate authentication mechanism from the Mechanism drop-down list and provide the necessary credentials and authentication settings in the fields below the list.
  11. Note:

    For detailed information about configuring authentication, see the Simba Spark ODBC Driver Installation and Configuration Guide.

  12. To test the connection to your Spark data store, make sure that Spark is running, and then click Test in the Simba Spark ODBC Driver DSN Setup dialog box.
  13. The Test Results dialog box appears, displaying a message informing you whether the connection succeeded or failed.

  14. Click OK to close the Test Results dialog box, and then click OK to save your DSN and close the Simba Spark ODBC Driver DSN Setup dialog box.

Note:

For more information about configuring the Simba Spark ODBC Driver, see the Simba Spark ODBC Driver Installation and Configuration Guide.