@easewithdata

To Install PySpark in your Local using Docker, follow the below steps (remove square brackets):
1. Run command [docker pull easewithdata/pyspark-jupyter-lab]
3. Run command to run container: [docker run -d -p 8888:8888 -p 4040:4040 --name jupyter-lab easewithdata/pyspark-jupyter-lab]

To setup PySpark Cluster with Jupyter Lab, follow the below instructions:
1. Clone the repo : [https://github.com/subhamkharwal/docker-images]
2. Change to folder > pyspark-cluster-with-jupyter
3. Run the command to create containers: [docker compose up]

Make sure to the Jupyter Lab Old for the cluster executions.
In case of any issue, please leave a comment in with Error message.

@funnyvideo8677

sir idk why ur not reaching and many are not subsribing but whatever ur doing ur doing with passion and whover it helps their home god will bless u thanks

@DataEngineerPratik

I successfully completed a comprehensive PySpark video course that provided a solid understanding of Spark's overall architecture, DataFrame operations, and Spark internals. The course also covered advanced topics, including optimization techniques in Databricks using Delta Tables. Thanks a lot :)

@satyamgour9461

Best PySpark lecture I have ever found.

@Shreekanthsharma-t6x

Absolutely loved this PySpark tutorial! Thank you for such a great resource—looking forward to more content from you!

@ChandraS-j1f

what an amazing youtube channel I found recently while searching to learn Data engineering concepts. you are the most knowledgeable person and best content .Keep rocking brother. we will support you🙌🙌

@ayyappahemanth7134

I am waiting for this single video to come, to go once again. I went through the playlist already. It's excellent🎉

@easewithdata

To setup PySpark Cluster with Jupyter Lab, follow the below instructions:
1. Clone the repo : [https://github.com/subhamkharwal/docker-images]
2. Change to folder > pyspark-cluster-with-jupyter
3. Run the command to build image: [docker compose build]
4. Run the command to create containers: [docker compose up]

In case of any issue, please leave a comment in with Error message.

@NiteshShinde-xt3hs

Sir can you please make apache Airflow tutorial for orchestration

@alexfoster93

one of the best channels i've found as im learning data engineering! would you consider making a video on lakesail's sail? supposedly its 4x faster than Spark, with 90% reduced hardware costs, built on rust. super curious your thoughts!

@syedmugheesbukhari9661

one of the best and point to point explaination

@sanooosai

great sir, its gold mine thank you for sharing your valuable information

@funnyvideo8677

great content practicing again

@joe_coconuts

that's amazing video thank you so much  --- from China

@kaushikjnayak5602

Great video. How will you make sure random salting will not result in join keys not matching at all? Deterministic salting on department_id will not solve the skewing problem either.

@isaacafedzi3368

it is a great video but you have to improve on the sound. it's very hard to hear what you say

@sanskaragrawal8686

Best Video for pyspark 🍀

@yashitshrivastava2260

2:24:17 can you elaborate how to setup standalone spark session and how to access it (Localhost:8080)??

@adarshgupta7152

Bhai jaroor pilauga coffee. but aise nahi. sath me piyege.🤝

@ABQ06

I have been gone through ur channel,having little confusion 
Can u provide detail road map like from where to start?