comment on

I've been playing around with Google's App Engine recently, and blogging about the datastore a little. It turns out they prefer their data denormalized -- which was kind of hard for me to adjust to at first, I must admit. Amazon's SimpleDB is the same way.

I've gotten used to the idea, however, at least for some application cases. Not only that, it got me to wondering what would happen if you used a standard RDBMS like MySQL and denormalized your data? Would it scale easily? Would its performance be good?

I pursued the idea and came up with OtoDB::MySQL, a Perl module that provides a simple interface for using data across multiple MySQL servers. It forces you to structure your data using one-to-one relationships, and because of this, makes scaling (adding additional data servers) a straightforward proposition. As it currently works, OtoDB::MySQL simply reads and writes data incrementally on each data server in a given list of servers, effectively striping data across n data units.

The "spec" can be downloaded here (pdf, 106k). Pretty much every thought I've had on the subject to date (correct or not) is in this document.

Here's the OtoDB 0.01 source (tar.gz).

use OtoDB::MySQL;

# Logic to iterate through units is the responsibility of the user.
# These structures are used for the round-robin approach in this 
# example.

my $curr_unit = 0;

my @units = (
    # unit, connection string, username, password
    ['srv1','DBI:mysql:otodb:localhost','root',''],
    ['srv2','DBI:mysql:otodb:10.1.6.212','root','sandwich']
);

 
# Instantiate the object.
# next_unit() handles iteration over the array of servers above.
# In a 'real-world' example, you might delegate these duties to an
# external server, a global library, etc.

my $m = OtoDB::MySQL->new( units=> \&next_unit );


# Create the data models.
# 'fields' are just MySQL column names/types

$m->create(
    model=>'library',
    fields=>{
        libid=>'int not null',
        name=>'varchar(30)',
        street=>'varchar(50)',
        city=>'varchar(30)'
    } );

# The 'oto_ref' column creates a link to the 'library' model.
# All references are required fields (i.e. 'not null')

$m->create(model=>'book',
    fields=>{
        title=>'varchar(30)',
        author=>'varchar(30)'
    },
    oto_ref=>'library'
);

 

# Now, we add some data
# $key = [ entity, unit (server), unique ID of record ]

my $key = $m->add(
    entity=>'library',
    data=> {

        # genid() creates a universal 'user' ID
        libid=> $m->genid( @{$units[0]} ), 
        name=> 'Library A',
        street=> '131 Street',
        city=> 'City One'
    }
);


# Link to 'library' using $key from previous add()

$m->add(
    entity=>'book',
    data=> {
        title=>'of mice and men',
        author=>'steinbeck'
    },
    oto_ref=>$key
);
 

# Add books to a second library
# Use first server as the ID server. Using a single server makes sure
# that user data IDs will be unique across all units.

my $libid = $m->genid( @{$units[0]} );


# Combine add() calls to satisfy 'oto_ref'.

$m->add(
    entity=>'book',
    data=> {
        title=>'of mice and men',
        author=>'steinbeck'
    },
    oto_ref=> $m->add(
        entity=>'library',
        data=> {
            libid=> $libid,  # unique 'user' ID for libary B     
            name=> 'Library B',
            street=> '111 Street',
            city=> 'City the Third'
        })
);


$m->add(
    entity=>'book',
    data=> {
        title=>'winter of our discontent',
        author=>'steinbeck'
    },
    oto_ref=> $m->add(
        entity=>'library',
        data=> {
            libid=> $libid,  # unique 'user' ID for libary B
            name=> 'Library B',
            street=> '111 Street',
            city=> 'City the Third'
        })
);


# Query data
# Data is conceptually 'flat', or one-to-one. The 'entity'
# parameter is basically a hint. The '<' means that the entity
# to the right is a sub-entity of the entity to the left. Consider:
#      entity => 'library < book < item_detail'
# Because 'book' is a many to 'library', save processing cycles
# by NOT reducing.

my @ents = $m->get(    
    entity=> 'library < book',
    fields=> 'library.city, library.name, library.street, book.title',
    filter=> 'library.name like ?',
    params=> ['%two%'],
    order=> 'book.title asc',
    reduce=>'no'        # 'reduce' assumed if omitted
);

 
# @ents is an array of lightweight objects - OtoDB::MySQL::Entity

foreach my $ent (@ents) {
    print $ent->field('name') . "\t" . $ent->field('city') . "\n";
    print "data:\t" . join('..', $ent->row), "\n";
}


# Set 'reduce' to 'yes' to eliminate duplicates due to data being 
# spread through multiple units. Generally, this will only be
# necessary for 'one' entities, i.e. entities that have 
# duplicates.

@ents = $m->get(
    entity=> 'library',
    fields=> 'library.name, library.street',
    order=> 'library.name desc',
    reduce=>'yes'
);

foreach my $ent (@ents) {
    print $ent->field('name') . "\t" . $ent->field(1) . ":\n";
    print "\tdata:\t", join('..', $ent->row), "\n\n";
}

 
# 'reduce' is specifically turned off. You will see duplicates from
# each unit

@ents = $m->get(
    entity=> 'library',
    fields=> 'library.name, library.street',
    order=> 'library.name asc',
    reduce=>'no'
);

foreach my $ent (@ents) {
    print $ent->field('name') . "\t" . $ent->field(1) . ":\n";
    print "\tdata:\t", join('..', $ent->row), "\n\n";
}

 

# Duplicating an entity
# dup() is given a filter expression, and returns the first entity
# it finds in the data store matching that expression. Here we
# add another book based on our 'library ID'.

$m->add(
    entity=>'book',
    data=> {
        title=>'grapes of wrath',
        author=>'steinbeck'
    },
    oto_ref=> $m->dup(
        entity=>'library',
        filter=>'library.libid = ?',
        params=> [$libid]  # remember this guy from above?
    )
);

 
# Update
# 'fields' is a collection of the column names and the new
# values you want to replace.

$m->update(
    entity =>'library < book',
    fields=>{ # use fully qualified field names
        'library.name'=>'new lib a',
        'book.title'=>'OFMICEANDMEN'
    },
    filter=>'library.name like ? and book.author like ?',
    params=>['%two%', 'steinbeck']
);

 
# Deleting 
# Delete based on a filter. Note: OtoDB does a check to see
# if there are any references from or to and entity, if you
# pass a single entity as a param. It dies if any exist.

$m->del(
    entity =>'library < book',
    filter=>'library.name like ?',
    params=>['%one%']
);


# Delete book matching title at a given library
$m->del(
    entity=> 'library < book',
    filter=> 'library.name like ? and book.title like ?',
    params=> ['%lib%', '%winter%']
);

 
# Increment through units, round-robin. A required subroutine.
# A reference to this routine is passed to the $m object upon
# instantiation, and it's how OtoDB::MySQL gets the next unit for
# data operations.

sub next_unit {

    # get current
    my $n = $curr_unit;

    #increment
    $curr_unit = ++$curr_unit > $#units ? 0 : $curr_unit;
    return $units[$n];
}
[download]

I find it hard to get a feel for how an idea really works until I've actually implemented something in it. So I made a rudimentary Wiki application called RubberWiki (included in the source above, see the rubwiki folder), based on OtoDB::MySQL. In my testing it was trivial to move from a single MySQL server to two (one config file edit), and query performance was good against both servers.

I think that querying -- OtoDB::MySQL->get() -- is currently where the bottleneck will occur as the number of data servers reaches a certain quantity. Some things that I thought would improve this:

threaded get()
cache servers (i.e. memcached)
distributed get()

I'd like to hear your thoughts. Thanks.

A blog among millions.

In reply to RFC: OtoDB and rolling your own scalable datastore by arbingersys

Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!

Titles consisting of a single word are discouraged, and in most cases are disallowed outright.

Read Where should I post X? if you're not absolutely sure you're posting in the right place.

Please read these before you post! —

Posts may use any of the Perl Monks Approved HTML tags:

a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr

You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)

	For:		Use:
	&		`&`
	<		`<`
	>		`>`
	[		`[`
	]		`]`

Link using PerlMonks shortcuts! What shortcuts can I use for linking?

See Writeup Formatting Tips and other pages linked from there for more info.