Advanced Computing in the Age of AI | Wednesday, April 24, 2024

Calpont Announces InfiniDB for Hadoop 

Calpont Corporation today announced at the Strata Conference + Hadoop World, InfiniDB for Apache Hadoop. InfiniDB for Apache Hadoop complements organizations’ Hadoop infrastructure with a columnar architecture that is optimized for rapid and easy access to highly dimensional data and has the flexibility, scale and horsepower to work with massive data sets.

Hadoop and HDFS have been a great way for organizations to capture, process and efficiently store copious amounts of unstructured data. But there is a difference between collecting data and querying data.

“InfiniDB for Apache Hadoop differentiates itself by empowering organizations to actively use, engage and execute high-performance analytics directly within an Apache Hadoop cluster against all of the data. InfiniDB for Apache Hadoop incorporates SQL query language and data structures for Hadoop deployments without constraining or bottlenecking performance. This is simply not possible with any other technology or platform on the market,” said Jim Tommaney, chief technology officer at Calpont.

Highlights of InfiniDB for Apache Hadoop include:

  • InfiniDB for Apache Hadoop provides structured query performance for both large dimensional or non-dimensional data sets where speed and reliability are paramount to business execution.
  • No need to custom code or learn new programing languages for Hadoop, Big Data developers can use the familiar and rich SQL syntax to develop high performance analytic queries.
  • InfiniDB is the first MySQL storage engine to implement the SQL standard Analytical Functions to enable new in-database analytics horsepower and further leverage existing developers, analytic tools, and solutions.
  • As part of the InfiniDB platform, organizations have the ability to run the same technology on their choice of local disk, GlusterFS, AWS, and HDFS without any compromise of performance, scale or flexibility.
  • InfiniDB incorporates a library of embedded frameworks, job scheduling and cluster resource management features that are battle-proven for high performance Big Data analytics.

 

More organizations are using Hadoop for large scale structured and unstructured data. However, the sheer volume of data collected often results in a Hadoop implementation that is no longer performing at the desired speed for analysis. Unlike parallel, column-oriented databases, such as InfiniDB for Apache Hadoop, many Hadoop implementations don’t distribute query processing and have to write intermediate query results to local data structures as part of the query execution process that can result in bottlenecks. In addition to slowing query response, this results in a mishmash of custom-coded infrastructure that can limit future technology scaling to support growing business requirements.

The InfiniDB for Apache Hadoop is optimized to do faster queries and scale as processing demands grow. InfiniDB is built for Big Data analytics, with a massively parallel processing (MPP), column-oriented architecture that is designed specifically for read intensive analytic applications and data warehouses that consume large amounts data. InfiniDB scales performance linearly with hardware, is easy to deploy and maintain, and requires no special database tuning, indexing or materialized views, making it ideally suited for Big Data analytics. Results of benchmarking tests that demonstrate the robust and consistent performance are available on the InfiniDB for Apache Hadoop webpage.

EnterpriseAI