CDAP Blog

A Look at Automating Cluster Creation in the Cloud with Coopr

davidb

Coopr is a cluster provisioning system designed to fully facilitate cluster lifecycle management in public and private clouds. In this blog, we will take an inside look at what happens when Coopr provisions a cluster. Deploying clusters can be time-consuming. For many system deployments, this work can be accomplished with a configuration management tool such … Read more


Join us for the new Big Data Application Meetup

alexb

Cask is determined to help big data application developers on their journey of building and deploying Hadoop solutions. We’re happy to announce a new meetup for the developer community—the Big Data Application Meetup—a group for everyone interested in building applications using Apache Hadoop™ and other open-source, big data technologies. Meetup topics will be focused on … Read more


Deploying CDAP packages from source via Coopr

chrisg

Developing features for CDAP follows a similar workflow as working on many projects. Developers have their local checkout of the source, make modifications in a feature branch, build and test locally on their development machines, push their branch, and submit a pull request for code review. During this process, developers build CDAP clusters (for testing) … Read more


How we built it: Making Hadoop data exploration easier with Ad-hoc SQL Queries

Please note: Continuuity is now known as Cask, and Continuuity Reactor is now known as the Cask Data Application Platform (CDAP). We are excited to introduce a new feature added in the latest 2.3 release of Continuuity Reactor – ad-hoc querying of Datasets. Datasets are high-level abstractions over common data patterns. Reactor Datasets provide a … Read more


Running Presto over Apache Twill

alvin

Please note: Continuuity is now known as Cask, and Continuuity Reactor is now known as the Cask Data Application Platform (CDAP). We open-sourced Apache Twill with the goal of enabling developers to easily harness the power of YARN using a simple programming framework and reusable components for building distributed applications. Twill hides the complexity of … Read more