Thanks for a great Workshop.

There is a google group for follow-up conversation. There are plans to have bay area centered meetups between now and the next conference. Join the list to participate/learn more.

Notes as they come in are being published on our blog.

We will be working with the community to plan Big Data #2 between August and October 2010

Big Data Workshop is on April 23 2010 9am-5pm at the Computer History Museum in Mountain View, CA.

As the internet approaches 1 trillion connected gigabytes, we’re not in Kansas anymore. The data deluge poses important questions:

  • How will we manage all this information? Is the relational database doomed? How will we synchronize it? Will we all need to migrate to NoSQL stores? Or will the new play along nicely with 40 years of relational history?

  • Who will manage this information? Will we all have to own our own massive infrastructure, will we rent it, or just call the APIs of somebody else?

  • How will we analyze this information? Do we all need to learn Erlang and Map-Reduce, or will a new set of easy-to-use tools spring up, just like the spreadsheet came to the rescue a long time ago? Will programmers write code for distributed hash tables, relational databases, document stores, graph databases or all of the above?

  • Who will we govern all that information? Who will keep it secure, and private, and audible? Who determines what can and cannot be correlated? Who will watch the watchmen?

This event is for:

  • owners and managers of a large amount of data, including web, social media, health, pharmaceuticals, astronomy, government etc.
  • developers and users of Big Data technologies, including NoSQL databases, Map-Reduce algorithms, data mining, server farms etc.
  • stewards and guardians of Big Data, including legal and business professionals.

Together we can actually dive in and figure some of these challenges out. If you are interested in a conference where keynote speaker after keynote speaker is telling you how wonderful their company and products are, Big Data Workshop is not for you. Big Data Workshop is there for everybody to speak and listen, to discuss, to engage, to think and to re-think.

The agenda will be created live on the day of the event by attendees, facilitated by Kaliya Hamlin who has designed and facilitated over 100 unconferences for professional technical communities.

Our model for the Big Data Workshop is the highly successful Internet Identity Workshop, a similarly-structured unconference that has taken place twice a year for the past 5 years. It has successfully brought together all the movers and shakers concerned with identity on the internet; the workshop has played a crucial role in the development and adoption of such widely deployed technologies such as OpenID, OAuth, Portable Contacts, Yadis/LRDD/Webfinger, and Activity Streams.  The Big Data Workshop will be similarly structured in order to facilitate a real discussion of the emerging problems and solutions by community and industry leaders.

  • By George Ludwig, April 2, 2010 @ 5:06 pm

    I’m very interested in NoSQL solutions that use distributed memory caches. It would be great if there was a session on serialization strategies, especially with regards to updating the data model and node rebalancing.

  • By Brett Sheppard, April 5, 2010 @ 11:45 am

    Looks like a good event. Unfortunately I’m out of town that day and won’t be able to attend, but could list the event at I can also set up an author blog profile for you or another member of the event if you’d like to post a pre-event blog and updates onsite on the day of the event.

    Brett Sheppard

  • By Richard Karpinski, April 10, 2010 @ 5:53 pm

    While four hours of driving plus registration fees make this too expensive for me, I hope you will record it and will share the recording, freely. Since the topic is of wide interest world wide, I’ll also recommend that you make a wiki, even before the event, to enable and encourage wide participation in pursuing the many topics which are sure to arise. Just talking about hoards of data makes me suggest collecting and sharing even more.

  • By Matthew Gonzales, April 14, 2010 @ 8:23 pm


    Thanks for putting this together. I’d like to be evaluated for the Databases directory as well as learn about any opportunities to get involved.

    Thank you
    Twitter: @Hypertable

Other Links to this Post

WordPress Themes