Spark

/Spark

Configuring YARN Capacity Scheduler Queues in AWS EMR

By | 2017-12-01T16:01:50+00:00 November 2nd, 2017|Categories: AWS, Big Data Performance, EMR, Hadoop, Scheduler, Spark|

Introduction AWS EMR clusters by default are configured with a single capacity scheduler queue and can run a single job at any given time. This blog talks about how you can create and configure multiple capacity scheduler queues in YARN Capacity Scheduler during the creation of a new EMR cluster or when updating existing EMR clusters. [...]

MityLytics for IoT Performance and Scalability Testing

By | 2017-10-26T12:37:23+00:00 October 23rd, 2017|Categories: Big Data Performance, Cassandra, Hadoop, IoT, Kafka, Spark, Storm|

In speaking with folks running IoT stacks, which typically span several streaming and NoSQL technologies such as Kafka, Spark, Storm and Cassandra, it became apparent that they often struggle with understanding how an existing setup scales and performs with varying workloads. So to address this, we enhanced our performance testing module to now offer IoT sensor [...]