Cloudera and Hortonworks: The SimilaritiesCloudera as well as Hortonworks are both built upon the same core of Apache Hadoop. As such, they have more similarities than differences.
- Both offer enterprise-ready Hadoop distributions. The distributions have stood the test of time as well as consumers, ensuring security and stability. Besides, they provide paid training and services to familiarize the newcomers treading the path of Big Data and Analytics.
- Both have established communities that actively participate and help with the problems faced as well as demonstrations needed.
- Both distributions have master-slave architecture.
- Both have a shared-nothing computing framework.
- Both support MapReduce as well as YARN.
Cloudera vs. Hortonworks: The DifferencesThat being said, the differences are the ones that play a deciding role of choosing one vendor over the other. Broadly, Cloudera and Hortonworks differ in the following aspects:
- Cloudera has announced that its long term goal is to become an “enterprise data hub,” thus diminishing the need of data warehouse. Hortonworks, on the other hand, remains firmly a provider of Hadoop distro, and has partnered with data warehousing company Teradata.
- While Cloudera CDH can be run on windows server, HDP is available as a native component on the windows server. A Windows-based Hadoop cluster can be deployed on Windows Azure through HDInsight Service.
- Cloudera has a proprietary management software Cloudera Manager, SQL query handling interface Impala, as well as Cloudera Search for easy and real-time access of products. Hortonworks has no proprietary software, uses Ambari for management and Stinger for handling queries, and Apache Solr for searches of data.
- Cloudera has a commercial license, while Hortonworks has open source license. Cloudera also allows the use of its open- source projects free of cost, but the package doesn’t include the management suite Cloudera Manager or any other proprietary software.
- Cloudera has a free 60-day trial, Hortonworks is completely free.
* May contain third party collection information
**Information collected from best practices articles