"Big Data" and "DevOps" technologies are a hot and highly valuable skill to have and our vagrant and Hadoop courses will teach you how to quickly create a distributed environment for you to deploy these technologies on.
A combination of VirtualBox and Vagrant will transform your desktop machine into a virtual cluster. However this needs to be configured correctly. Simply enabling multinode within Vagrant is not good enough. It needs to be tuned. Developers and Operators within large enterprises, including investment banks, all use Vagrant to simulate Production environments.
After all, if you are developing against or operating a distributed environment, it needs to be tested. Tested in terms of code deployed and the deployment code itself.
You'll learn the same techniques these enterprise guys use on your own computer/laptop.
Vagrant provides easy to configure, reproducible, and portable work environments built on top of industry-standard technology and controlled by a single consistent workflow to help maximize the productivity and flexibility of you and your team.
Our Hadoop courses use VirtualBox to carve out an virtual environment.
I am a huge fan of this instructor
This course is an amazing overview of big-data and the power of being able to understand it, Props to the instructor
Brian Peck (Udemy Cloudera Student)
For those with little or no knowledge of the Hadoop eco system course : Big Data Intro for IT Administrators, Devs and Consultants
To carve out a distributed environment that models your physical topology as well as avoiding corrupting your physical servers you can complete the course : Real World Vagrant For Distributed Computing
To deploy Cloudera Manager plus agents. Agents are the guys that will sit on all the slave nodes ready to deploy your Hadoop services. Course : Real World Vagrant - Automate a Cloudera Manager Build
Once Cloudera is installed you will need to deploy the Hadoop services across your cluster. We look at the logic regarding the placement of master and slave services. Course : Real World Hadoop - Deploying Hadoop with Cloudera Manager
If you want to play around with HDFS commands (Hands on distributed file manipulation). Course : Real World Hadoop - Hands on Enterprise Distributed Storage.
You can also automate the deployment of the Hadoop services via Python (using the Cloudera Manager Python API). This is an advanced step. Ensure you understand how to manually deploy the Hadoop services first. Course : Real World Hadoop - Automating Hadoop install with Python!
There is also the upgrade step. Once you have a running cluster, how do you upgrade to a newer hadoop cluster (Both for Cloudera Manager and the Hadoop Services). Course : Real World Hadoop - Upgrade Cloudera and Hadoop hands on
I was looking for a guide to help me understand and debug the DAG construction by spark along-with how distributed architecture is consumed
This course has been insightful in many way to accomplish the objective
Toyin has a novel way to deliver the concepts
Kedar Gupte (Udemy Spark Student)
These set of courses are targeted at DevOps and enables you to package a complete Spark Development environment into your own custom 2.3GB vagrant box.
Once built you no longer need to manipulate your Windows machine in order to get a fully fledged Spark environment to work. With the final solution, you can boot up a complete Apache Spark environment in under 3 minutes!!
Install any version of Spark you prefer. Once you have configured Spark, further courses look at manipulating Spark from pyspark, spark-shell, sparkIDE and Jupyter.
Why Apache Spark ...
Apache Spark run programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk.
Apache Spark has an
advanced DAG execution engine that supports cyclic data flow and in-memory computing.
Apache Spark offers over 80 high-level operators that make it easy to build parallel apps. And you can use it interactively
from Scala and Python shells.
FinTech for Programmers
These FinTech programmer courses has been built from the ground up. They do not impose any database, server or middleware requirements and are aimed at quantitative or systems developers tasked with delivering analytic solutions into trading, risk, clearing, prime servicing or collateral businesses.
The courses teach the student how to build Capital Market Products along with valuation. Products include Interest Rate Swaps, Bonds, Cap/Floors, Swaptions.
We will have FinTech courses in Java and Scala (for the JVM) and C#, VB.NET, C++/CLI, F# and IronPython (for .NET)
Each course will walk through Capital Market Financial Calculator/Models using Visual Studio for Windows (.NET) or ScalaIDE (Java/Scala)
FinTech for Excel (no VBA required!!)
These FinTech XL courses has been built from the ground up to be flexible. They do not impose any database, server or middleware requirements and are aimed at quantitative or systems developers tasked with delivering analytic solutions into trading, risk, clearing, prime servicing or collateral businesses.
The courses teach the student how to build Capital Market Products along with valuation.
Products include Interest Rate Swaps, Bonds, Cap/Floors and Swaptions.
Each course provides Capital Market Financial Spreadsheet Calculator/Models using Excel for Windows, our Training FinTech Function XLL Addin, Portability (Read/Write to
Json / XML), Student Templates and PDF guides
If you are a developer, these courses will help you will isolate dependencies and their configuration within a single disposable, consistent environment, without sacrificing any of the tools you are used to working with (editors, browsers, debuggers, etc.). Once you or someone else creates a single Vagrantfile, you just need to vagrant up and everything is installed and configured for you to work. Other members of your team create their development environments from the same configuration. Say goodbye to "works on my machine" bugs.
If you are an operations engineer, these courses will help you build a disposable environment and consistent workflow for developing and testing infrastructure management scripts. You can quickly test your deployment scripts and more using local virtualization such as VirtualBox or VMware. (VirtualBox for this course). Ditch your custom scripts to recycle EC2 instances, stop juggling SSH prompts to various machines, and start using Vagrant to bring sanity to your life.
If you are a designer, these courses will help you with distributed installation of software in order for you to focus on doing what you do best: design. Once a developer configures Vagrant, you do not need to worry about how to get that software running ever again. No more bothering other developers to help you fix your environment so you can test designs. Just check out the code, vagrant up, and start designing.