Someone asked me that is my favourite ETL tech. Without giving any second thought, I answered that it is Databricks.But why?
Someone asked me that is my favourite ETL tech.
Without giving any second thought, I answered that it is Databricks.
But the next big question is why?
Let me answer this question in clear cut point wise manner:-
1)easy to set up the environment without thinking much about the on premise hardware requirement.
2)multilanguage support without doing any installation.
2)different variety of cluster that it supports. It has clusters with GPU as well
2)clusTer autoscaling capability which scales up and scales down based on the load.
3)Unity catalog which provides centralized access control, auditing, lineage, and data discovery capabilities
4)capability of databricks to implement ML algorithms use cases over it.
5)its capability to enable multiple users to write code in the same notebook.
6)its SQL warehouse capability
7)Delta tables which is game changer in the Big Dat Industry.
8)Autoloader which can easily detect changes from few sources and process the data.
9)its availability over all the major cloud platform in the market like AZURE,AWS,GCP.
DO you also think that databricks is the best ETL tool in the market right now? Do comment if you have any other thoughts or any other point which you would like to add.
My session contains a very in depth discussion around databricks.
My next batch on Azure Data Engineering will be starting on 20th July.DM me or call us on 7870970617 for enrollment.
Do subscribe to my youtube channel and get more contents about data engineering.