how to make spark work in jupyter notebook
how to make spark work in jupyter notebook
export PYTHONPATH=$SPARK_HOME/python/:$PYTHONPATH
export PYTHONPATH=$SPARK_HOME/python/lib/py4j-0.10.3-src.zip:$PYTHONPATH
to connect spark to mongo db setup mongo connector:
git clone https://github.com/mongodb/mongo-spark.git
./sbt check
./sbt +publish-signed
add this variables before running it in notebook
export PACKAGES="org.mongodb.spark:mongo-spark-connector_2.11:2.0.0"
export PYSPARK_SUBMIT_ARGS="--packages ${PACKAGES} pyspark-shell"