Month: July 2020

Analyze COVID-19 Dataset with Databricks | Databricks Unified Analytics Platform

In this article, we will analyze COVID-19 Dataset using Databricks unified analytics platform using the community edition of the platform, which is totally for free and you can use it as your playground to test Apache Spark applications in Python or R depends on your favorite API of development. Dataset will be used in this …

Analyze COVID-19 Dataset with Databricks | Databricks Unified Analytics Platform Read More »

Detailed Guide for String Wrangling in SQL | MySQL | SQL Analysis

Extracting information from string columns is almost a repetitive necessity in Data Engineers, Data Scientists, and Business Analysts day to day tasks, and this task can be done using a programming language such as Python, or by SQL depends on your application and on the task required. In this tutorial, we will discover together how …

Detailed Guide for String Wrangling in SQL | MySQL | SQL Analysis Read More »

Build Data Analysis and Data Discovery Web Application for Data Science projects in few minutes | Data Science | Data Analytics

Data preparation and data discovery consume a great amount of time in any data science or data analytics job, one of the solutions is to write a template script that you can use in this phase of your job, but what about adding interactive controls and dynamic controls into your scripting wouldn’t that be great?, …

Build Data Analysis and Data Discovery Web Application for Data Science projects in few minutes | Data Science | Data Analytics Read More »

So Which Machine Learning Algorithm to use?

A lot of data science practitioners found the process of selecting a machine learning algorithm overwhelming and confusing. That’s because there are a bunch of algorithms that can do the same task. For example, classification can be done using a Decision Tree, SVM, Logistic Regression, Naive Bayes, KNN, and Neural Network.  Now, which one should …

So Which Machine Learning Algorithm to use? Read More »

Data Science Roadmap .. Concepts, Tools, and Technologies

In this article, we will depict some skills and concepts that must be learned in the journey of becoming a data scientist but first, what is data science?  Data Science is the art of uncovering the insights and trends in data. It has been around since ancient times. The ancient Egyptians used census data to …

Data Science Roadmap .. Concepts, Tools, and Technologies Read More »

Setup Talend Open Studio on Linux

Introduction Talend is an open-source data integration platform. It provides different solutions and services for data integration, data quality, cloud storage, and Big Data. According to the latest Gartner report, Talend named in the leader’s quadrant among other data integration solutions. In this article, we will show you step by step to install and configure Talend …

Setup Talend Open Studio on Linux Read More »