Best way to Run Spark -
there group project working on , want utilize spark. however, not know best way run on our computers. had thought maybe hortonworks, , suggested looking @ maven. aren't sure though.
we students can't buy (at least expensive).
when search ways run spark on computer (windows), pops compiling techniques or code help.
you have lots of options:
- download source github or apache , run locally. readme has instructions or can download learning spark , read chapter 2.
- download cloudera distribution hadoop 5 quickstart virtual machine. requires virtual machine player vmware or virtualbox (make sure vt-x enabled in bios). runs spark in pseudo-distributed stand-alone mode , allows run inside yarn container configuration changes. can run spark locally out-of-the-box dependencies installed.
- download hortonworks virtual machine. works similar cloudera's offering not familiar it.
i recommend cloudera if machine capable of running heavy-weight vm , want try running in pseudo-distributed environment. if want learn how program using api, try first option.