CDAP Blog

Monitoring Key Hadoop Operational Statistics using CDAP

bhooshan

The Cask Data Application Platform (CDAP) is the first Unified Integration Platform for Big Data. It provides users with higher level abstractions and APIs over complex, low-level systems for building  Big Data applications. It does the heavy lifting involved in integrating various platforms in the Apache Hadoop ecosystem, to provide a single end-to-end platform. To … Read more


CDAP 4 – Introducing Cask’s Big Data App Store, Cask Market, plus Cask Wrangler, a new UI and more

Vinisha Vyasa

We are very happy to introduce the general availability of the 4th generation of Cask’s flagship product – CDAP 4. This release builds on what we learned over the past few years from our users and the community. This post summarizes the major enhancements in CDAP 4, namely, New & Revamped User Experience, Cask’s “Big … Read more


Integrating CDAP with Microsoft Azure HDInsight

We recently announced the integration of CDAP with the Microsoft Azure HDInsight platform. This post will give a behind-the-scenes look at this integration. First, a bit about the integration itself. Azure HDInsight is an Apache Hadoop and Spark distribution powered by the cloud. This means that it handles any amount of data, scaling from terabytes … Read more


Cask Tracker Enhanced: Metadata Taxonomy and Data Usage Analytics in CDAP 3.5

Yue Gao and Riwaz Poudyal

Cask Tracker is a self-service CDAP Extension that automatically captures rich metadata and provides users with visibility into how data is flowing into, out of, and within a Data Lake. Tracker was first introduced in CDAP v3.4. Tracker v0.2 has just been released along with CDAP 3.5 and packs a ton of new features. Dataset … Read more


CDAP 3.5 – Enterprise Security, Drag-and-Drop Spark Streaming, and much more!

Sagar Kapare

I am very excited to announce the release of Cask Data Application Platform (CDAP) version 3.5. The focus for CDAP 3.5 is security, with a number of significant new capabilities added to the platform, in addition to major improvements to the Extensions, Cask Hydrator and Cask Tracker. CDAP 3.5 introduces authorization to the platform with … Read more





Powering BI with ODBC Connectors for CDAP

bhooshan

Open Database Connectivity (ODBC) is the de-facto standard API for accessing data stored in relational databases. ODBC drivers allow applications across a variety of platforms (especially non-Java) to access relational databases in a manner independent from the implementation and the operating system. In this blog we will discuss the integration between CDAP Datasets and Tableau … Read more


Announcing CDAP Release 3.4: Introducing Tracker, Next-gen Hydrator, Enhanced Spark support, and much more!

Rohit Sinha

I am very happy to announce the general availability of our flagship product, the Cask Data Application Platform (CDAP), version 3.4. This release introduces a fresh new look for Cask Hydrator, and improvements to it that extend beyond data ingestion use cases, such as building aggregations and performing data science on the ingested data. The … Read more