Month: April 2020

Dimension Keys – Part 1 – Natural Keys | Data Warehouse

Dimensions tables are core part of any Data Warehouse modeling. In general dimension tables store details side of any event or business process, for example, for a purchase operation from a retail store we will have dimension tables to store customer information, product information, store information, and so on, on the other hand, Fact tables …

Dimension Keys – Part 1 – Natural Keys | Data Warehouse Read More »

Functions in Scala – Part 1 | Scala

Scala is a  a multi-paradigm language that supports both functional and object-oriented programming with a growing community and many useful features Scala worth learning, and it has been adopted by big enterprises such as Linkedin , Twitter, and many others. As Functional programming is one of the main strength points in Scala and and understanding …

Functions in Scala – Part 1 | Scala Read More »

Migrate Files from local files system to Amazon S3 with Python Application | AWS S3 | Python

Amazon Simple Storage Service (Amazon S3) is an object storage service that offers industry-leading scalability, data availability, security, and performance. S3 storage well fit in different use cases, such as websites, mobile applications, backup and restore, archiv2e, enterprise applications, IoT devices, and big data analytics. Amazon S3 provides easy-to-use management features so you can organize …

Migrate Files from local files system to Amazon S3 with Python Application | AWS S3 | Python Read More »

Create Scala Project on Intellij with Scala Worksheets | Scala

Scala is a  a multi-paradigm language that supports both functional and object-oriented programming with a growing community and many useful features Scala worth learning, and it has been adopted by big enterprises such as Linkedin , Twitter, and many others. When you start experimenting Scala you can use Scala interactive REPL (Read Evaluate Print Loop) …

Create Scala Project on Intellij with Scala Worksheets | Scala Read More »

Setup Apache Spark environment on Windows | Apache Spark

Apache Spark is easy to use, unified platform for all purposes of big data processing, and equipped with rich set of APIs for different application needs as Spark DataFrame and Spark SQL for structured data processing, Spark Streaming and Structured Streaming for streaming applications, Spark MLib for machine learning applications, Spark Graphx for Graph analytics …

Setup Apache Spark environment on Windows | Apache Spark Read More »

Apache Spark Application Execution Mode | Apache Spark

Apache Spark is a powerful processing platform for big data applications that supports different big data processing types. In this article we will discover together how Apache Spark application can be executed in multiple modes, depends on the environment architecture and on the application requirements. Before going into details, if you would like to setup …

Apache Spark Application Execution Mode | Apache Spark Read More »