Updated database landscape graphic

One of the most popular pieces I have produced since joining 451 is not a research report or presentation but the database landscape graphic that accompanied our NoSQL, NewSQL and Beyond report.

We’ve seen it crop up in other presentations and websites – sometimes even with attribution ;-)

We actually updated the image to accompany our more recent report MySQL vs. NoSQL and NewSQL: 2011-2015 but I realised that I haven’t made that newer version more generally available. So here it is:

We wouldn’t claim it to be perfect. There’s a whole new breed of data platform-as-a-service providers that have emerged in recent months that will need to be added, if we can find space for them.

Meanwhile there are a group of database vendors that have also emerged that don’t easily fit into the segments we’ve created: companies like Drawn to Scale, FoundationDB, Aerospike and Splice Machine.

But since the original graphic continues to be popular, I thought I’d share the latest iteration as well. Any feedback always welcome

The Role of NoSQL and Graphs in the Total Data Landscape

I’ll be flying over to San Francisco at the weekend to attend and present at GraphConnect, which takes place at the Hyatt Regency on November 5 and 6.

Specifically, I’ll be giving a presentation on the role of NoSQL and graphs in the total data landscape, subtitled Big Data, Total Data, NoSQL, Graph, at 11.40am on November 6.

Here’s the overview: The database market is changing rapidly with new approaches emerging that provide an alternative to the relational data model. This presentation examines the drivers behind the rise of NoSQL data stores and, in particular graph databases, focusing on their use-cases and adoption trends, and exploring where graph database fit in the world of NoSQL, NewSQL, and big data.

I’ll also be moderating a panel at 5.05pm on November 6 comprised of enterprise companies that use graph databases in production. This panel includes 3-4 technical leads from Accenture, Cisco and Telenor Norway that will discuss what it takes to put large scale graph databases into production.

GraphConnect looks like a great event for anyone with experience with, or just interest in, graph databases. Keynotes will be provided by Emil Eifrém, CEO, Neo Technology, and James Fowler, co-author of Connected: The Surprising Power of Our Social Networks and How They Shape Our Lives.

The full agenda can be found here, and it’s not too late to register, here.

Forthcoming webinar: Choosing a Next-Gen Database

On Tuesday, November 13 at 12pm ET I’ll be taking part in a webinar in association with ScaleBase on the subject of Choosing a Next-Gen Database: The new world order of NoSQL, NewSQL & MySQL.

With the database market becoming increasing complex and changing on an almost daily basis, I’ll be providing an overview of this ever-changing market: discussing the benefits and drawbacks of NoSQL, NewSQL & MySQL databases and exploring real-life use cases for each.

Joining me will be Doron Levari and Paul Campaniello, both from ScaleBase, who will be discussing specific use cases of ScaleBase’s Data Traffic Manager, which is designed to enable next generation applications that require big data transactional processing, without changing the existing infrastructure.

For full details, and to register for the event, click here.

The Data Day, Two days: September 27/28 2012

Quantcast launches HDFS alternative. Glassbeam eyes funding. And more.

And that’s the Data Day, today.

NoSQL LinkedIn Skills Index – rebooted

I decided to reboot our analysis of NoSQL skills, according to LinkedIn search results.

There are two main reasons for doing so: the first iteration did not take in enough of the various NoSQL projects; and I have – with help – worked my way around the eccentricities of LinkedIn search to produce a more accurate result for Apache Cassandra.

The analysis therefore now incorporates a wider spectrum of NoSQL projects, the top ten most popular of which are displayed below. The chart illustrates the number of LinkedIn member profiles mentioning each of the NoSQL projects:

The main change from the previous results is the promotion of Apache Cassandra, thanks to our better search string, while MarkLogic is the first of our new additions to make the top ten.

What hasn’t changed is the dominance of MongoDB, which is way-ahead of all the others. While I am not breaking out growth percentages versus previous counts due to the reboot, it is fair to say that MongoDB is outpacing many of its rivals. Neo4j and DynamoDb are also growing particularly well.

In fact, as can be seen from the chart below, MongoDB accounts for 43% of all mentions of NoSQL technologies in LinkedIn profiles, according to our sample.

The Data Day, Three days: August 15/16/17 2012

Symantec teams CFS with Hadoop. Informatica Cloud. And more

And that’s the Data Day, today.

NoSQL Road Show heads for Basel

The latest leg of the NoSQL Road Show will take place in Basel, Switzerland at the end of August.

The schedule has been expanded compared to previous events, and now includes presentations related to Apache HBase, MongoDB, Dynamo, Hadoop and Neo4j, in addition to Riak.

I will once again be presenting on the topic of NoSQL, NewSQL, Big Data…Total Data – The Future of Enterprise Data Management, although since is the first NoSQL Road Show I have attended since the publication of our MySQL vs. NoSQL and NewSQL, I’ll be adding in some new content related to market sizing and growth estimates, as well as competitive trends.

The event takes place August 30 at Ramada Plaza Basel. Full details and registration information are available here.

The Data Day, Two days: August 13/14 2012

Datomic calls time on RDBMS. Actian offers $154m for Pervasive. And more

And that’s the Data Day, today.

Updated: NoSQL database skills popularity, according to LinkedIn

A few weeks ago I updated our previous analysis of LinkedIn search results to take us up to the end of Q2 2012, but for one reason or another I didn’t get around to posting the results. As you can see from the results, there isn’t anything particularly controversial to say about them, other than that MongoDB continues to extend its lead over the rest in terms of the number of mentions in LinkedIn member profiles.

One thing that isn’t immediately clear from the chart is that Neo4j overtook Riak in the second quarter to claim sixth place by a narrow margin. Also, Apache HBase has consolidated its third place position having overtaken Apache CouchDB in the first quarter.

Neo4j’s ascendency is due to it having the fastest growth rate of all the NoSQL projects in Q2, according to our search results. The number of Neo4j mentions in LinkedIn profiles grew 65.8% between our March and June searches, followed by MongoDB (59.5%) and Redis (59.1%).

As you can see, I have also added Amazon’s DynamoDB to the list for this iteration. It will be interesting to see how it performs going forward. As usual, it should be noted that the results for Apache Cassandra are probably disproportionately low since we have to search for the full phrase in order to avoid including people called Cassandra.

I have also adjusted once again how we assess Couchbase/Membase, having belatedly realised that LinkedIn supports boolean searches. As of this point forward, the result represents a search for Membase OR Couchbase.

Once again, we would also note that this is not meant to be a comprehensive analysis, but rather a snapshot of one particular data source.

A different perspective on NoSQL vendor traction

Amid the reporting of 10gen’s $42m funding round yesterday a specific claim about 10gen’s success to date caught my eye.

“10gen says it’s got about half the NoSQL market wrapped up already. This is based on… indicators, such as how often LinkedIn profiles mention MongoDB.”

While our own analysis of LinkedIn profiles did indeed indicate that 10gen has a sizeable lead over its NoSQL rivals, this only accounts for the NoSQL market *to date*, and the NoSQL vendors have barely scratched the surface.

451 Research recently estimated that NoSQL software vendors between them generated revenue of just $20m in 2011 (less than half 10gen’s latest funding round), and that the market will grow at a CAGR of 82% to reach $215m by 2015.

10gen is well placed to capitalize on this growth given its customer and revenue traction to date. While we are not breaking out individual revenue estimates the chart below shows revenue and customer estimates for 10gen, Basho, Couchbase and DataStax, with the scale adjusted to fit on a single chart.

The chart appears to confirm 10gen’s claim to have half the NoSQL market wrapped up, at least in terms of customers. However, what this chart doesn’t address is the relative strategy stage of each vendor in terms of customer traction.

10gen has done extremely well in growing a large customer base via its focus on ease of developer adoption, and is now turning its attention to the sort of capabilities required by traditional enterprises.

Other vendors in the NoSQL space have done precisely the opposite: starting with enterprise capabilities and now turning their attention to greater ease of use and developer adoption.

We can begin to get a sense of how these strategies are playing out if we add a column for revenue per customer (again re-scaled). Here you can see that 10gen is actually doing less well than some of its rivals.

The size of the MongoDB installed base gives 10gen a big opportunity to aim at, but others are arguably ahead in terms of traction with enterprise customers. That’s why our market sizing methodology is specifically designed to take multiple (sometimes conflicting) factors into account in creating an estimate for each vendor, as well as the aggregate total.

10gen may well have about half the current NoSQL market wrapped up but this market has really only just begun.