February 18, 2015
With the number of application scenarios for Hadoop rapidly expanding to include in-memory applications based on the Apache Spark framework, adoption of Hadoop in the cloud is rapidly expanding. To address demand for access to Spark, Altiscale today at the Strata + Hadoop World 2015 conference announced that it now supports Apache Spark on the Altiscale Data Cloud.
Built from the ground up to support big data applications in the cloud, Altiscale CEO Raymie Stata said the Altiscale Data Cloud is a vertically integrated cloud that is optimized for both deploying and managing Hadoop clusters on an ongoing basis. What many IT organizations don’t always appreciate about Hadoop is how challenging it can be to upgrade a Hadoop cluster. Not only do all the servers in a Hadoop cluster have to be identical, upgrading those clusters can be a challenge in terms of maintaining a consistent set of configurations.
For those reasons Stata said that more IT organizations have been turning to the cloud to deploy Hadoop applications in production. The issue that many of those organizations will encounter, however, is that general purpose cloud computing environments are not optimized for Hadoop, said Stata. In fact, it was that very issue that led Datameer to unfurl a Big Data analytics application running in the cloud on top of the Altiscale infrastructure-as-a-service (IaaS) platform.
To help secure those applications Altiscale also announced today that it is adding support for Kerberos authentication to improve the overall security of the Altiscale Data Cloud environment.
In general, the Apache Spark framework makes use of YARN (Yet Another Resource Negotiator) to allow applications running on Hadoop to run in memory. Rather than simply treating Hadoop as a massive data repository, YARN makes it more feasible to deploy real-time applications on Hadoop, as opposed to only batch-oriented applications.
A recent study that Altiscale commissioned Forester Consulting to conduct found that reliance on the Altiscale Data Cloud reduced Hadoop job failure rates by over 60 percent. The study also found that there was a 46 percent reduction in the amount of time a Hadoop job took to complete and a 30 percent reductions in wasted compute cycles. The end result, said Stata, is not only enhanced productivity for data scientists, but also reduced IT labor costs.
With more applications heading towards both Hadoop and the cloud solution providers across the channel need to become savvier about where and when Hadoop can be deployed. There’s no doubt that Hadoop is transforming the cost of storing massive amounts of data. But it’s also a platform for developing and deploying a new generation of Big Data applications that just like any other application in the cloud still needs to be managed by organizations with particular sets of expertise.
About the Author(s)
You May Also Like
The Gately Report: Qualys Partners Getting New Channel Program in Early 2024Dec 11, 2023
8x8, AT&T, Cisco, Other Vendors Update Partner ProgramsDec 08, 2023
Meet Channel Futures' Top 20 UC/Contact Center Leaders for 2023Dec 08, 2023
New Gartner UCaaS Magic Quadrant ID's 2023 Industry LeadersDec 07, 2023