The Data Day, A few days: November 9-14 2013

Total Data Integration. PostgreSQL on RDS. And more

And that’s the data day, today.

The Data Day, A few days: October 12-18 2013

Apache Hadoop 2 goes GA. Teradata cuts guidance. And more

And that’s the data day, today.

The Data Day, A few days: September 20-30 2013

Three reasons why Nirvanix failed. And more

And that’s the data day, today.

7 Hadoop questions. Q2: Hadoop infrastructure choices

What is your preferred infrastructure for Hadoop deployments? That’s one of the primary questions being asked in the 451 Research 2013 Hadoop survey. The answer will have significant implications for the future direction of Hadoop.

hadoop-elephant

While one of the primary benefits of Hadoop – low cost data storage – means that for many organisations the primary infrastructure for Hadoop has been commodity hardware, many systems and storage vendors now offer their own dedicated appliances and/or reference architecture for Hadoop.

We expect to see more of these dedicated Hadoop configurations as the incumbent infrastructure vendors look to cash-in on Hadoop adoption and try to add greater value.

We also see some companies exploring the potential for Hadoop in the cloud, as well as hosted deployments, and on virtual infrastructure – although those are arguably in the early stages of technical maturity, and adoption.

survey

Which infrastructure configurations are most popular? That’s one of the things our survey is designed to find out. The early results perhaps unsurprisingly indicate a greater preference for Hadoop being deployed on commodity hardware. However, cloud and virtual deployments have also scored well.

Interestingly, the early results show the preference for Hadoop on cloud infrastructure is significantly higher among respondents that are still in the development and test stage with Hadoop, which supports our anecdotal evidence about the use-cases for Hadoop in the cloud.

In order to get a little more detail on deployment preferences, the survey also asks about the level of consideration, testing and adoption for dedicated Hadoop hardware and Hadoop-as-a-service offerings respectively.

Among the choices in the dedicated hardware category are offerings from DataDirect Networks, Dell, HP, Oracle, IBM, Pivotal, Teradata, Cisco and NetApp.

The choices in the Hadoop-as-a-service category include Altiscale, Amazon EMR (including MapR), MapR on Google Compute Engine, Microsoft Windows Azure HDInsight Service, Mortar Data, Qubole, Rackspace Big Data, SunGard Unified Analytics Services and Treasure Data.

To give your view on this and other questions related to the adoption of Hadoop, please take our 451 Research 2013 Hadoop survey.

The Data Day, A few days: July 11-17 2013

FoundationDB acquires Akiban. Cloudera acquihires Myrrix. And more

And that’s the data day, today.

The Data Day, A few days: July 1-10 2013

Oracle launches Database 12c. Predixion raises $20m. And more.

And that’s the data day, today.

The Data Day, A few days: June 11-25 2013

A bumper round-up of the past 14 days’ data-related news

* Cisco announced its intention to acquire Composite Software.

* Software AG acquired Apama.

* TIBCO Software acquired StreamBase Systems.

* Cloudera appointed Tom Reilly as Chief Executive Officer and Mike Olson as Chief Strategy Officer and Chairman of the Board.

* Sears Holdings named Jeff Balagna Chief Executive Officer of MetaScale

* Ex-Yahoo CTO launched Altiscale, hardcore Hadoop as a service.

* SpaceCurve raised a $10M Series B round of financing.

* Sqrrl announced general availability of Sqrrl Enterprise.

* GE launched Predictivity services, supported by supported by Proficy Historian HD.

* Datameer announced Datameer 3.0.

* Oracle announced the general availability of MySQL Cluster 7.3.

* MemSQL announced the upcoming availability of MemSQL 2.1.

* Continuuity announced the release of Weave, a new open source project that enables Java developers to rapidly build scalable, distributed applications on YARN.

* RainStor adds security, text search features to database complement for Hadoop.

* Composite Software introduced version 6.2 SP3 of its Composite Data Virtualization Platform

* TokuDB launched TokuMX.

* Terracotta announced the immediate availability of Terracotta Universal Messaging.

* HP united its data management assets under HAVEn brand.

* Hortonworks and Red Hat announced an engineering collaboration around Hadoop.

* Rackspace Hosting’s ObjectRocket Database as a Service entered into a strategic agreement with 10gen.

* Simon Phipps posted State Of The Sea Lion – June 2013.

* Netflix announced that its Genie Hadoop-aaS management software is now open source

* Storm-YARN released as open source.

* Big Data arrived at the Oxford English Dictionary

And that’s the data day, today.

The state of the MySQL ecosystem

I’ll be on a whistle-stop tour of California next week, including two presentations at the Percona Live MySQL Conference and Expo.

On day one at 1:20pm I’ll be presenting CAP Theorem: Two out of three ain’t right, in which I will be challenging the popular ‘two out of three’ explanation of CAP Theorem, examining the evidence from a variety of experts, including Dr Eric Brewer.

Then on day three at 9:00am I’m very honoured to be providing a keynote presentation, The State of the MySQL Ecosystem.

Here’s an overview of the presentation:

It is now over three years since Oracle acquired MySQL along with Sun Microsystems. Fears for the open source database’s survival appear to have been misplaced as Oracle has increased investment in MySQL development. At the same time, a thriving ecosystem of potential alternatives and complementary products has emerged to provide MySQL users with greater choice in terms of both functionality and support. As a result of that choice, we are seeing the increasing independence of the ecosystem of MySQL-related products and services from MySQL itself – both in terms of a commercial product, and also a development project. The continued maturity of vendors such as Percona and SkySQL, as well as the formation of the MariaDB Foundation, has the potential to accelerate that trend. The MySQL ecosystem is far from fragmenting, but 451 Research’s updated survey of database users indicates that the center of gravity has begun to shift towards an increased state of independence.

Beyond our own presentations of course there is a huge number of presentations of interest during the event from both users and vendors, and I’m looking forward to attending as well as presenting.

If you haven’t already registered you can do so now and get 15% off by using the code “SpeakMySQL” at checkout. Register here.

The Data Day, A few days: April 9-12 2013

Funding for MarkLogic and ParElastic. And more

And that’s the data day, today.

The Data Day, Two days: February 7/8 2013

Teradata results. Funding for DataXu. The chemistry of data. And more.

And that’s the data day, today.