Big Names Create Big Database
Big Names Create Big Database
Written by Kay Ewbank   
Friday, 28 March 2014

Facebook, LinkedIn, Twitter and Google have cooperated on the creation of a custom version of MySQL designed specifically for use in apps that need to scale massively.

WebScaleSQL is a custom version of MySQL designed for large scale web applications. The changes the companies have made to the database are to be made available as open source, and the information is being shared with the main MySQL project.



Writing on the project’s website, the WebScaleSQL contributors said:

“we know we’re not the only ones who are trying to solve these particular challenges. So we will keep WebScaleSQL open as we go, to encourage others who have the scale and resources to customize MySQL to join in our efforts.”

In a blog post Steaphan Greene of Facebook said that the goal in launching WebScaleSQL is

“to enable the scale-oriented members of the MySQL community to work more closely together in order to prioritize the aspects that are most important to us.”

Changes to WebScaleSQL compared to the main branch of MySQL include improvements to the performance with buffer pool flushing improvements; optimizations to certain types of query including prefix index queries; and support for NUMA interleave policy.

The optimization to prefix index queries is to get them to skip cluster index lookup when possible. Currently InnoDB will always fetch the clustered index for all prefix columns in an index, even when the value of a particular record is smaller than the prefix length. This change optimizes that case to use the record from the secondary index and avoid the extra lookup.

NUMA (Non-Uniform Memory Access) supports hardware with multiple system buses, each with a small set of processors where each group of processors has its own memory. Each CPU can access memory associated with the other groups, and each group forms a NUMA node. MySQL has traditionally had ‘issues’ dealing with NUMA nodes – you can read a good summary in a blog post by  MySQL architect Mikael Ronstrom’s blog or on Jeremy Cole’s blog.

The WebScaleSQL option lets you set startup options either to flush and purge buffers and caches, or to run Mysql with its memory interleaved on all CPUs.

WebScaleSQL also includes new features for working at web scale, such as super_read_only, and the ability to specify sub-second client timeouts.

If you are interested in contributing to the project, WebScale SQL is on GitHub.


More Information

WebScaleSQL site

WebScale SQL on GitHub

MySQL Server and NUMA architectures

Related Articles

MySQL 5.6 Released

InfiniSQL - Try It For Size

SQLFire - Speeds and Scales


To be informed about new articles on I Programmer, install the I Programmer Toolbar, subscribe to the RSS feed, follow us on, Twitter, Facebook, Google+ or Linkedin,  or sign up for our weekly newsletter.


blog comments powered by Disqus



Introducing Arduino Create

Arduino Create, a new platform that will replace the existing Arduino development platform is out of private beta and open to all Arduino devs, although there is waiting list to join in.

Data Science Curriculum on edX

Data Scientists are in high demand - and now there's another way to earn relevant accreditation via the Microsoft-designed Data Science Curriculum offered on the edX platform.

More News

Last Updated ( Friday, 28 March 2014 )

RSS feed of news items only
I Programmer News
Copyright © 2016 All Rights Reserved.
Joomla! is Free Software released under the GNU/GPL License.