Cern Open Lab Summer Student
CERN
Fri Jun 01 2018 - Sat Aug 18 2018
- Presto Query Engine
- Scaling a big data framework
- Query testing CERN production data for use of performance analysis
- Automated bash scripting to automate executions of queries and gathering of results
- Using TPC-DS for benchmarking Apache Spark, Impala and Presto
- Configuration of spilling on presto and Impala Cluster
- Increased performance for the current Impala production set up at CERN