Best way to Run Spark -


there group project working on , want utilize spark. however, not know best way run on our computers. had thought maybe hortonworks, , suggested looking @ maven. aren't sure though.

we students can't buy (at least expensive).

when search ways run spark on computer (windows), pops compiling techniques or code help.

you have lots of options:

  1. download source github or apache , run locally. readme has instructions or can download learning spark , read chapter 2.
  2. download cloudera distribution hadoop 5 quickstart virtual machine. requires virtual machine player vmware or virtualbox (make sure vt-x enabled in bios). runs spark in pseudo-distributed stand-alone mode , allows run inside yarn container configuration changes. can run spark locally out-of-the-box dependencies installed.
  3. download hortonworks virtual machine. works similar cloudera's offering not familiar it.

i recommend cloudera if machine capable of running heavy-weight vm , want try running in pseudo-distributed environment. if want learn how program using api, try first option.


Popular posts from this blog