CDAP Blog


A Hydrator Python Transform for Python nerds like you and me!

John Jackson

Before every CDAP release, we at Cask conduct an internal hackathon to use CDAP and work on interesting features. A few Cask engineers got together and, wanting to open up the capabilities of Cask Hydrator beyond Java developers, decided to build a transformation that uses user-written Python. Beginning with CDAP release 3.2, the CDAP UI … Read more


Multiple Outputs in CDAP

alia

In CDAP, a MapReduce program can interact with a CDAP dataset by using it as an input or an output. Before CDAP 3.2.0 users could only have single dataset as the output of a MapReduce job. We wanted to extend this capability and allow writing to multiple datasets from MapReduce jobs to support the following … Read more



Announcing CDAP 3.2 – Hydrator and much more!

bhooshan

We are excited to announce the Cask Data Application Platform (CDAP) 3.2 release. This release brings many enhancements to existing CDAP features as well as lays the foundation for upcoming, advanced features—all designed to further simplify data application development. Cask Hydrator CDAP 3.2 introduces Cask Hydrator—a highly functional framework and UI to support self-service batch … Read more


Exploring Apache Cassandra with CDAP

As many of you may know, Cassandra Summit is fast approaching! In preparation, the team at Cask decided to integrate Cassandra and CDAP. Apache Cassandra is an open-source behemoth of a project, one of the most popular databases in the world. It stores the data in a token ring, where each host or node is … Read more