MemSQL Introduces Seamless Spark Connectivity for Enterprise Deployments
MemSQL Spark Connector Enables Real-Time Data Exchange Allowing Enterprises to Operationalize and Streamline Apache Spark Deployments
SAN FRANCISCO, February 10, 2015 - MemSQL, the leader in real-time databases for transactions and analytics, today announced performance connectivity to Apache Spark with its MemSQL Spark Connector, allowing enterprises to operationalize and streamline Spark deployments. The powerful combination of an in-memory database from MemSQL and Spark's in-memory processing framework gives enterprises immediate access to transactions, analytics, and data exploration—opening up new opportunities for revenue and improved customer experiences. The MemSQL Spark Connector is also available as open source, providing developers the ability to adapt it to their needs.
MemSQL Introduces Seamless Spark Connectivity for Enterprise Deployments - Click to Tweet
Traditionally, data has been siloed and stored in separate data stores—taking up lengthy and complex transfer time before data is made available to other systems. With the MemSQL Spark Connector, operational data is immediately available for analysis in Spark, giving enterprises and data exploration teams access to real-time data. Further, results from Spark operations can be quickly placed into production by transferring back to the MemSQL persistent and transactional database.
"Spark is an impressive execution engine for data exploration, and enterprises can now operationalize Spark results with persistent databases," said Eric Frenkiel, CEO and co-founder, MemSQL. "The integration between MemSQL and Spark represents the best of both worlds—giving enterprises an ability to automate data-driven decision making, perform real-time operational analytics and detect and respond to anomalies in real-time."
The MemSQL Spark Connector utilizes the in-memory and distributed architectures of both MemSQL and Spark for high performance parallel throughput. Regardless of the number of nodes in each cluster, the MemSQL Spark connector allows rapid transfer between the database and the Spark RDD (resilient distributed dataset).
“Users continue to cite data integration as an ongoing challenge, and the luxury of conducting time-intensive and rigid extract/transform/load (ETL) operations is diminishing,” said Nik Rouda, Senior Analyst, Enterprise Strategy Group. “Customers expect data solutions to be capable of achieving high performance analysis, with more complicated integration, and the MemSQL Spark Connector helps fill these needs.”
Enterprises now have access to real-time analytics on changing datasets, and an elegant and fluid way to interact and consume results from Spark. By simplifying an organization's infrastructure, companies can reduce latency, increase reliability and ensure the productivity and efficiency of its data scientists and developers. As a software solution, the combination of tools offers deployment flexibility as well as MemSQL and the MemSQL Spark Connector can be deployed on-premises or in the cloud.
“We are excited to see developments such as the MemSQL Spark Connector enhance the breadth of use cases with Apache Spark,” said Matei Zaharia, CTO at Databricks and Vice President of Apache Spark. “This is a great example of seeing the usability, performance, and library capabilities of Spark expand with MemSQL to meet the rapid adoption and interest in large-scale data processing.”
To download MemSQL Spark Connector, visit the MemSQL Github site.
Read the technical blog post here.
MemSQL is the leader in real-time databases for transactions and analytics.
As a purpose built database for instant access to real-time and historical data, MemSQL uses a familiar SQL interface and a horizontally scalable distributed architecture that runs on commodity hardware or in the cloud.
Innovative enterprises use MemSQL to better predict and react to opportunities by extracting previously untapped value in their data to drive new revenue.
MemSQL is proven in production environments across hundreds of nodes in high velocity Big Data environments.
Based in San Francisco, MemSQL is a Y Combinator company funded by prominent investors including Accel Partners, Khosla Ventures, First Round Capital and Data Collective. Follow us @MemSQL or visit at www.memsql.com.