How does it compare to 3 years old Joyent's Manta ?
AFAIK it was especially designed for this kind of purposes. The processing is made directly on the servers storing the data..
Manta is pretty similar to Elastic MapReduce, which also runs the computation on the same node as the data. So it compares pretty much the same as EMR.