CDAP Blog

Better collaboration and productivity: Cloud-based software development at Cask

Over the last few years, the popularity of cloud-based software development has risen dramatically, along with the need for sharing development assets and resources within and across organizations. Containers and open source have simplified the sharing and cloning of code and entire dev/test environments, taking efficiency, collaboration and productivity of product engineering organization to new … Read more



Overcoming Big Data Integration Challenges

Enterprise challenges Hadoop has emerged as the leading technology to solve a number of big data use cases. However, enterprises needing to solve their business problems often need to piece together different technologies to build a solution. Each component in the Hadoop technology stack is infrastructure focused and purpose-built to solve a unique set of … Read more


CDAP 4.1 – More Enterprise-Grade Hardening, Pre-Built Solutions and Enhanced UX

Nishith Nand

We are happy to announce the release of Cask Data Application Platform (CDAP) version 4.1. This new release brings with it some major enhancements and significant new capabilities in the platform, as well as new, ready-to-use solutions offered via Cask Market. CDAP 4.1 improves security by allowing fine grained secure impersonation. It introduces replication so … Read more


Monitoring Key Hadoop Operational Statistics using CDAP

bhooshan

The Cask Data Application Platform (CDAP) is the first Unified Integration Platform for Big Data. It provides users with higher level abstractions and APIs over complex, low-level systems for building  Big Data applications. It does the heavy lifting involved in integrating various platforms in the Apache Hadoop ecosystem, to provide a single end-to-end platform. To … Read more


Cask Tracker Enhanced: Metadata Taxonomy and Data Usage Analytics in CDAP 3.5

Yue Gao and Riwaz Poudyal

Cask Tracker is a self-service CDAP Extension that automatically captures rich metadata and provides users with visibility into how data is flowing into, out of, and within a Data Lake. Tracker was first introduced in CDAP v3.4. Tracker v0.2 has just been released along with CDAP 3.5 and packs a ton of new features. Dataset … Read more




Learning CDAP with Elasticsearch

Elasticsearch is a popular search engine based on Apache Lucene™. Unlike relational databases, Elasticsearch stores information in documents; each document has a type (with a mapping) that gives information about its schema, and similar documents are stored together in an index. Elasticsearch even allows time-based indices, so documents can be stored with other records created … Read more


Join us for the 2nd Big Data Application Meetup

Henry Saputra

Cask is proud to host the second Big Data Application Meetup on August 19, 2015 at Cask HQ in Palo Alto. By sponsoring and promoting knowledge-sharing and community-building through the Big Data Application Meetup, Cask continues to take lead in promoting technologies and best practices used to build big data applications. For the second meetup, we have … Read more