Are you an aspiring data engineer, who doesn’t know where to start? Don’t worry, I will walk you through this process. Which will give you a clear picture of the process.
1. Don’t Get Distracted
Yes As you are already reading this article it simply means that you have decided to become a big data engineer. Right? If Yes, keep Reading.
Don’t get distracted by other technology, there are tons of Technology available in software engineering, you can choose any of them. If you keep on switching your learning path in between, You are not going to become an expert in one.
And once you are an expert in one, later you can improve your skills to become a master.
So Focus on one Big Data Component and try to learn the use of this in real-life projects and slowly grasp the coding skills. And you can keep on adding more big data components in your learning path.
2. Get a Good Laptop
Get a laptop with at least 12-16GB RAM and a good processor. It might be a bit more costly than a normal laptop, but trust me it is an investment with higher ROI.
And install Linux based OS, and install major big data components one by one. And set up a complete standalone cluster on your laptop.
Tip: If you have a laptop with 8GB RAM, Still you can try.
3. Choose one Language
To become a good big data engineer, You have two choices :
- Python
- Scala
You can choose any one language of your choice and start practicing syntax and basic programs, But Don’t wait to Learn one language completely and then move to the data engineering path. Rather Learn both of them simultaneously.
4. Learn the Basics
Now You have the language to communicate and a laptop for Big data engineering. You can start and get yourself indulged in the Pile of Data.
Learn Hadoop and its major components like Hive, HBase, and Hdfs. and how parallel processing works. Once you have the idea of it, you can learn Apache spark and start loading small files and do basic transformation and analytics on it. Remember we are learning Big data, But you don’t actually need very big files for learning. Prepare your own data and try to visualize the output data and convert the requirement into code.
Once you have crossed the path of a beginner, you can choose new components like Apache Kafka, elastic search, and keep on learning them one by one. And trust me at some point you will realize the need for components. Like when you need to process the real-time data, you will be searching apache Kafka and so on.
You can refer to this website for learning, we have tutorials for big data.
5. Showcase Your skill
Do some minor projects on your own and upload the code to the GitHub or wherever you can show your skills. It will build your confidence in showing something.
Tip: I am not talking about simple word count or hello world programs here.
6 .Time of ROI
[ Return of Investment ]
It is up to you, it may be landing a job at a good company or making a new product.
And don’t forget to keep on learning other new big data components to become a master.
Yes, you can subscribe to us and all other big data programmers and increase your reach in the world. and don’t forget to share this post, if you share You might help other aspiring big data engineers to achieve their goal.
Tip: You can contact us for any help in your big data journey. if we can, we will do it for sure.
You can ask for help here.
Important Links to refer: