Your team is building a data engineering and data science development environment.
The environment must support the following requirements:
✑ support Python and Scala
✑ compose data storage, movement, and processing services into automated data pipelines
✑ the same tool should be used for the orchestration of both data engineering and data science
✑ support workload isolation and interactive workloads
✑ enable scaling across a cluster of machines
You need to create the environment.
What should you do?
Adi06
Highly Voted 3 years, 8 months agoallanm
3 years, 2 months agolevm39
3 years, 1 month agoprashantjoge
3 years, 2 months agostrikchao
2 years, 12 months agophdykd
Highly Voted 1 year, 6 months agophydev
Most Recent 1 year agodija123
2 years, 8 months agokolakone
3 years agoNavishmamta1111111111111
3 years, 1 month agookeyken1
3 years, 1 month agoMAGGCol
3 years, 2 months agoprashantjoge
3 years, 2 months agochaudha4
3 years, 3 months agoLakeSky
3 years, 3 months agocab123
3 years, 3 months ago