All Stories

PySpark Jupyter Notebook Configuration On Windows

PySpark on Windows can be installed using two different ways. Since Spark is a distributed compute engine, it also works stand alone. Most of the developer who are familiar with...

Apache Spark 3.0 Release Note (Preview)

Apache Spark 3.0 is released and available for testing in preview mode. The release was done on 2019-Nov-08 and it was announed via twiter. The preview mode is lauched to...

Snowflake SnowPro Practice Test

Snowflake SnowPro Core Certification

Snowflake Container Hierarchy Practice Test

Snowflake SnowPro Core Certification

Snowflake Architecture Practice Test

Snowflake SnowPro Core Certification

PySpark FAQ

About PySpark Local Installation Can I install PySpark in Windows 10? Yes, PySpark can be installed in Windows 10 or even earlier version of it, refer complete guide here. General...

What is new in Spark 3.0

This write up talks about Apache Spark 3.0 features and improvements. Apache Spark 3.0 is a major release and currently available in preview mode. Release 3.0.0 is a major release...

Snowflake Data Warehouse Glossary

Snowflake Data Warehouse Glossary talks about all the important keywords which a developer must know. Snowflake cloud data platform is cloud-native, faster, easier to use, and far more flexible than...

Snowflake Data Warehouse Best Practices

Snowflake is a cloud-native and easy to use virtual data warehouse system. Since it is built on the top of the cloud-native platform, traditional best practices do not applicable anymore....

Compare Unix Kernel Shells

This short article talks and compare UNIX kernel shells, which many technical folks are confused of. The Unix operating system used a shell program called theBourne Shell. Then, slowly, many...

Check Unix OS Version Using Putty

Many times you get access to a Unix or Linux box via terminal. Before start using the terminal, you may want to know the Unix flavor. This article will help...

Hadoop 3.0 Vs Spark 2.X

Many spark users who are using Hadoop as storage under the Spark computation is asking if Hadoop 3.0 vs Spark 2.x compatible or not. Spark 2.2.1 was released the 1st...

Hadoop 3.0 Vs Hadoop 2.0

Hadoop 3.0 vs Hadoop 2.0 : Hadoop 3.0.0 GA (General Availability) is released on 13-Dec-2017. Everybody wants to know what it brings into the table for developer, administrator and enterprise...

Hadoop 3.0 Security By Ben And Joey

Hadoop 3.0 Security by Ben and Joey: Protecting Your Big Data Platform is an excellent, practical, well-written book which describes the Apache Hadoop and the numerous security features within Apache...

Hadoop 3.0 Roadmap

Hadoop 3.0 Roadmap : Latest version of Hadoop is already out and everybody is excited about itsfeatures. Hadoop 3.0 is a major release after Hadoop 2.9 which was released in...