pySpark becomes a popular choice for many data engineers and becomes the mainstream technology for big data and machine learning projects. pySpark is not a programming language. It is a wrapper (or abstraction) to help python developers to write spark code. Apache Spark is developed in Scala (a functional Programming Language) and to make Apache Spark more accessible to the … read the rest
While using Chef for Windows, there are multiple backends for the Windows feature resource—DISM and servermanagercmd. Each one has a specific Ruby class that will be used based on the determined backend as follows:
- Chef::Provider::WindowsFeature::DISM: This uses DISM to manage roles/features (default unless DISM is not present)
- Chef::Provider::WindowsFeature::ServerManagerCmd: This uses Server Manager to manage roles/features (the fallback
Similar to how Linux distributions have package management tools and a repository of packages, Windows has long had built-in packages that come with the OS. Both desktop and server releases of Windows have installable components out of the box, with servers having more than desktops.
In Windows parlance, roles are similar to Chef’s notion of roles—a collection of software packages … read the rest
When managing Windows with Chef, there are some Windows-specific resources that are available to you as part of the Windows stack. This section covers those resources that are specific to Windows such as the Windows Registry, roles, MSIs, and so on; the ones that won’t be available on Linux systems.
Working with Windows-specific resources
Most systems administrators, managing Windows means … read the rest
This guide helps you in downloading and installing apache sqoop. Apache Sqoop supports the Linux operating system, and there are several installation options. One option is the source tarball that is provided with every release. This tarball contains only the source code of the project. You can’t use it directly and will need to first compile the sources into binary … read the rest
The official Apache Hadoop 3.0 Download was made available Dec 2017. The Hadoop 3.0 is a feature packed release with lots of new feature and enhancements. Since Hadoop 3.0 is not yet available with Cloudera CDH 6.x or Hortonworks HDP 3.x and you have to installed the basic Hadoop 3.0.3 from it official website.
Hadoop 3.0 Download
Download … read the rest
Hadoop 3.0 Hortonworks is an obvious question after you have seen Hadoop 3.0 new feature and enhancement list. At the time of writing this blog, HDP was having 2.6.4 supporting Hadoop 2.7.3. Hadoop 3.0 has lot of changes and if you want to try it in stand alone mode before it becomes available, it is available for installation. … read the rest
OCJP Practice Papers – Java Class Design include following topics
- Implement encapsulation
- Implement inheritance including visibility modifiers and composition
- Implement polymorphism
- Override hashCode, equals, and toString methods from Object class
- Create and use singleton classes and immutable classes
- Develop code that uses static keyword on initialize blocks, variables, methods, and classes
See the complete syllabus for OCJP here