Setup Talend Open Studio on Linux

Introduction

Talend is an open-source data integration platform. It provides different solutions and services for data integration, data quality, cloud storage, and Big Data. According to the latest Gartner report, Talend named in the leader’s quadrant among other data integration solutions. In this article, we will show you step by step to install and configure Talend open studio on Linux operating system. This article will include also the steps to install java as a prerequisite for Talend.

In the next article we will demonstrate how to use Talend ETL capabilities to implement one of the critical types of Data Warehouse table, which is Slowly Changing Dimension table, but before that in this article we will make sure we have Talend is installed and configured correctly to start using in the coming use cases.

Talend On Linux

Make sure that you have the following prerequisites before starting the installation:

  • Java
  • Talend Open Studio installer files.

Java Setup

Java is a main prerequisite to install Talend, make sure to download the java JRE version for your environment as following

For JDK version 8

$ Sudo apt install openjdk-8-jdk

For JDK version 11

$ sudo apt install openjdk-11-jdk 

Talend Setup

In the following steps we will download Talend installer files and we will walk through step by step on how to start Talend Open Studio

Check Java installation

Run the following command to check if you have Java installed and configured probably on your environment or not

$ echo $JAVA_HOME

If the output is empty of the previous command is empty, follow the next steps to add environment variable to point to your Java installation

1- open the ~/.bashrc file using vi editor or nano or any kind of file editors you have on your environment

$ nano ~/.bashrc

2- Add the java environment variable to your environment using the following format

export  JAVA_HOME=<directory where java is installed >

Run Talend Open Studio

After extraction of the Talend zip file we just downloaded, open terminal from the directory of where the file was extracted, give access to the user to make changes on the script file

$ chmod +x TOS_DI-linux-gtk-x86.sh

Now start Talend Open Studio by executing the script TOS_DI-linux-gtk-x86.sh from the command terminal

$ ./TOS_DI-linux-gtk-x86.sh

Don’t worry about the failed messages it’s because the installing of third-party libraries on the Windows can’t be installed

As first time user,you need to set up a new project or can also import Demo project, in the next startup you will have a list of existing projects to select from

  • Talend Studio requires specific third-party Java libraries or database drivers (.jar files) to be installed to connect to sources and targets. Those libraries or drivers, known as external modules, can be required by some of Talend components or by some connection wizards or by both. Due to license restrictions, Talend may not be able to ship certain external modules within Talend Studio.
  • choose the feature to install Third-party libraries (43)
  • click finish
  • Accept all the license of the selected packages

Sayed Ibrahem

Sayed Ibrahem is a passionate data engineer with experience focused on a building modern data pipelines using big data technologies such as Apache Spark, Apache Hive, and Cloud solution such as Google cloud platform big data and analytics solutions, besides using different data integration and data streaming technologies to build a complete data pipeline.

guest
0 Comments
Inline Feedbacks
View all comments