Drupal contributor statistics

I recently extracted some data from the Drupal project's CVS and Git logs to see how the number of code contributors and total contributions have changed over time. If there was any doubt of our continual growth, the resulting charts demolish it.

Commits
Aggregated results from core and contributed modules.
Committers
Aggregated results from core and contributed modules.

As can be seen from the graphs, there is a pretty big spike in commit activity post-Git migration.

Drupal 7 commit history

Drupal 7 commit history - absolute
We opened the Drupal 7 development branch in February 2008, and released Drupal 7.0 in January 2011. This graph shows the stacked commit history from beginning to end. I appointed Angie as my Drupal 7 co-maintainer in August 2008 after having been the sole committer for 7 months. The peak around August 2009 (the highest peak) was the first attempted Drupal 7 code freeze. The momentum steadily built up towards the initial code freeze date. Interestingly, we remained most productive during the extended code freeze period ... more code freezes are better than one code freeze? ;-)
Drupal 7 commit history - relative
I averaged at 3.6 commits per day, whereas Angie's average is 2.6 commits per day (including weekends and holidays).

Acquia Hosting: an update after 8 months of Drupal hosting

Roughly 8 months ago at DrupalCon Paris, we launched Acquia Hosting. In this blog post, I wanted to give a quick update on where we are after 8 months.

Current status

For those who don't know, Acquia Hosting is a highly-available cloud-based hosting platform tuned for Drupal performance and scalability. From a technology point of view, we've built tools to automatically launch multi-server hosting environments optimized for Drupal. It is built on Amazon Web Services (i.e. Amazon EC2, Amazon S3, etc) using Open Source components such as Varnish, Puppet, GlusterFS, NginX and more. If you are interested in the technical details, I highly recommend watching Barry Jaspan's DrupalCon San Francisco presentation on the challenges of hosting Drupal on AWS -- I'm biased, but it is the best technical presentation that I've seen on hosting websites on Amazon Web Service (AWS). Highly recommended. The presentation is the result of 2.5 years of experience building products exclusively on Amazon Web Services and having to maintain close to 200 EC2 instances.

At Acquia, we're all very proud of what we've built. For example, we were recently able to have a new, enterprise-scale Acquia Hosting customer online only a few days after they first contacted us. It takes most hosting companies weeks or months to roll out, configure and tweak all the servers required to host a high-traffic traffic sites like this one. In just a few days, we scaled past the limits of their previous hosting provider and flawlessly served 3 million page views per hour (i.e. 830+ page views per second or 5000+ HTTP requests per second -- yes, Drupal scales). I hope the customer will allow us to write-up a detailed case study at some point. It is a real success story for Drupal, Acquia Hosting, Amazon Web Services and cloud computing in general: incredible time to market, great performance and scalability. We've come a long way since we started working on a Drupal hosting product about a year ago.

The way we started work on Acquia Hosting is the way we have continued: with a very strong focus on engineering. Our first area of focus was on reliability. The results of this were: providing multiple, redundant web nodes; real-time database replication; backups; monitoring infrastructure (we track 25+ system parameters); customer isolation, and so on. Next, we focused our efforts on improving Acquia Hosting's performance by adding tools like Varnish for page caching; customer isolation; reorganizing parts of our underlying architecture; lots of tweaking to Apache, PHP and MySQL; and repeated rounds of realistic load testing. Along the way, we developed deployment tools to make it easy to roll-out and automatically configure our customers' EC2 instances -- it takes just a matter of minutes to upgrade a site's capacity.

Roadmap

Considering our costs and other metrics eight months into the hosting business adventure, the real value of our hosting offering comes not from the technology alone, but rather from our support team's work while getting customers' sites online and helping them day in, day out. Once servers are provisioned for a new site, getting customers up-and-running involves detailed site audits (making sure they don't have core hacks, analyzing their site architecture, etc.), teaching them how the Acquia Hosting environment works, helping them learn to best leverage clusters of servers, doing load testing, and helping them get over performance bottlenecks (slow or excessive SQL queries, expensive uncached Views or blocks, etc.). At the end of the day, our team's deep knowledge of Drupal and our technology stack are the essence and ultimate value-proposition of our Drupal hosting offering.

Going forward, a top priority is to make the process of getting new customers online easier for us and better for them. Among other things, that means developing more "self-service"-style systems, improved customer dashboards and documentation, and streamlined, focused support operations to make sure our customers are getting their questions answered and their problems fixed in the shortest time possible so they can worry about their business and not their websites.

Free Acquia Hosting program

We also announced a free Acquia Hosting program. To help support the Drupal community, we give free Acquia Hosting to sites for non-profit groups that promote Drupal use and adoption. We're now hosting 25 community websites including Drupal Edu, SpreadDrupal, Drupal Dojo, Drupal Catalan, Design 4 Drupal Boston and more. There are about 50 more Drupal community sites in the backlog waiting to get setup with an Acquia Hosting account. Yet another reason to make it easier to get new users and customers up and running!

State of Drupal presentation (April 2010)

Two weeks ago at DrupalCon San Francisco I gave my traditional state of Drupal presentation. A total of 6000 people watched my keynote live; 3000 were present at DrupalCon, and another 3000 watched the live video stream. Nonetheless, a lot of people asked me for my slides. So in good tradition, you can download a copy of my slides (PDF, 48 MB) or you can watch a video recording of my keynote on archive.org.

Keynote backstage
Picture taken backstage while waiting to go on stage for my keynote.
Keynote view
Picture taken after my keynote just before packing up my laptop.

State of Drupal presentation (September 2009)

Two weeks ago at DrupalCon Paris, I gave my traditional state of Drupal presentation. The video of the presentation is available from archive.org, and you can download a copy of my slides (PDF, 8 MB) as well.

I don't want to give away the spoiler, but essentially, the state of Drupal is strong. :) We should be really proud of what we have accomplished with Drupal 6, and what we're about to accomplish with Drupal 7. In the presentation, I also talk about what it means for Drupal to grow up, and what the next phase of our life will most likely look like.

Drupal can help pay for your rent

The demand for Drupal talent continues to exceed the supply! What will we do about it?

Job trends july
Source: Indeed.com.

Drupal 6 growth

Drupal growth

The graph above is made based on the project usage statistics collected on drupal.org. As ever with statistics of this sort, they don't tell the whole story. This is because only sites running the update status module report data back to drupal.org. This module is part of Drupal 6 and the installer prompts the user to enable the module when Drupal is first installed. It is not required to enable this module. People upgrading from Drupal 5 aren't even prompted to enable it. Plus, many Drupal sites are hidden behind corporate firewalls. As a result, we don't really know how many Drupal 6 sites there are.

Either way, based on the growth data that we do have available, we can predict that we will near 240,000 Drupal 6 sites by January 2010. See the black trendline on the graph. The R-square represents the variability in the data set that is accounted for by the prediction model. Its value indicates how likely the predicted values are -- the closer to 1.0, the better. If Drupal 6 continues to grow like it did the past 9 months, our prediction should be pretty accurate. It would mean that the number of Drupal 6 sites will double over the next 9 months. Not bad.

I always believed that the best way to grow Drupal is to make the software better, and that is why we continue to work hard on Drupal 7. But until Drupal 7 is released, there are a lot of things that we can do to help people get started with Drupal 6 -- from offering .zip-files instead of .tar-files, to launching the drupal.org redesign, to sharing more Drupal 6 success stories, and more. There are a lot of barriers that could be removed and that would result in faster growth.

Once in a while, it is good to make lists. You're all invited to share your list of "things you think we should do" in the comments of this post. I recommend that you prioritize your list so the most important item is at the top. If you are actively working on any of the items on your list, let us know too. Bonus points for ideas that have high impact, require minimal effort and benefit the Drupal community at large. Penalty points for people that recommend Drupal 7 features based on self-interest. Ready, steady, go!

Faster is better

Through a presentation from Nicole Sullivan, a former member of Yahoo’s Exceptional Performance Team and co-author of O’Reilly's upcoming book on performance optimization, I came across the following data points:

  • Amazon: 100 ms of extra load time caused a 1% drop in sales. (Source: Greg Linden, Amazon)
  • Google: 500 ms of extra load time caused 20% fewer searches. (Source: Marrissa Mayer, Google)
  • Google: trimming page size by 30% resulted in 30% more map requests. (Source: Marrissa Mayer, Google)
  • Yahoo!: 400 ms of extra load time caused a 5 to 9% increase in the number of people that clicked "back" before the page even loaded. (Source: Nicole Sullivan, Yahoo!)

While we all knew this was true (and while I'd like more detail on these tests), it is nice to have some quantitative data from different sources. Long story short: even the smallest delay kills user satisfaction. Let's make Drupal even snappier! (Hat tip: Peter Van Dijck)

State of Drupal presentation (August 2008)

Last week at DrupalCon Szeged I gave my traditional state of Drupal presentation. The video of the presentation is provided below, and you can download a copy of my slides (PDF, 11MB) as well.

The presentation discusses the results of the recent survey that I conducted; the survey ran for 30+ days and collected more than 1300 responses so it should provide a good idea of the community's current thinking. I'll provide more color and details about the survey results in a number of follow-up posts.

Source: archive.org.

Google insights on Drupal

Recently, Google launched Google Insights. Like with Google Trends, you can just type in a search term to see search volume patterns over time, as well as the top related and rising searches. You’ll also have the ability to compare search volume trends across multiple search terms, categories (commonly referred to as verticals), geographic regions, or specific time ranges. Great for marketing people.

Below are some examples specific to Drupal ...

Google Insights - Search volume for Drupal in the world
The numbers on the graph reflect how many searches have been done for a particular term, relative to the total number of searches done on Google over time. See Google Insights results for Drupal.
Google Insights - Search volume in China
In China, Wordpress is winning hands down. See Google Insights results for Drupal.
Google Insights - Search volume for Belgium
In my home country, Belgium, Drupal is almost as strong as Wordpress but not nearly as strong as Joomla. See Google Insights results for Drupal.
Google Insights - Regional Drupal interest by country
Regional Drupal interest by country. Google uses the term 'search volume index' for these heatmaps, meaning that they normalized the data by the total traffic from each respective region. In other words, just because two regions show the same percentage for a particular term doesn't mean that their absolute search volumes are the same. See Google Insights results for Drupal.
Google Insights - Regional Drupal interest in USA
In the US, the west coast beats the east coast. Based on 'search volume index'. See Google Insights results for Drupal.
Google Insights - Regional Drupal interest by city
Regional Drupal interest by city. Based on 'search volume index'. See Google Insights results for Drupal.
Google Insights - Drupal search terms
The top search on Drupal -- great for marketing people. Breakout means that the search term has experienced a change in growth greater than 5000%. See Google Insights results for Drupal.

Updates from Dries straight to your mailbox