I have machine learning code written in 2 different files .py(python) and .ipynb(notebook).
These scripts are present locally.Data is getting queried using sqlContext
as data is stored on cloud (cloudera cluster)
问题是如何在集群上运行.py/.ipynb(它们在本地存在),以便它们能够访问集群上存储的数据并生成所需的结果。