How do you get your data?

Google: “The information is out there, we just have to crawl it and index it.”

Amazon: “The data lies in user behavior, we just have to apply the right machine-learning technique.”

Facebook: “If we make it fun, users will enter the data themselves.”

Twitter: “If we build an API, third parties will solve the problem and all the data will just flow through our system.”

How do you get your data?

(PS: People who don’t have an answer to this question yet: most of the social-recommendations apps, all the semantic web guys)

  • http://www.amymossoff.com/ Amy Mossoff

    When writing internal apps, I found Facebook's model to be the most useful. Nobody had a problem finding data; people had a problem entering data. Think, busy salesmen.

  • http://www.amymossoff.com/ Amy Mossoff

    When writing internal apps, I found Facebook's model to be the most useful. Nobody had a problem finding data; people had a problem entering data. Think, busy salesmen.

  • http://twitter.com/cjerrells Christopher Sutton

    “People who don’t have an answer to this question yet: [...] all the semantic web guys”
    I suggest you check out the Linked Data movement, which primarily uses semantic web technologies to expose existing data sets as linked, machine-processable data:
    http://linkeddata.org/

    You can argue that this is bootstrapping the 'real' semantic web (where semantically marked-up data is published as standard), or that this is itself the most effective use of semantic web technology. Either way, I don't think you can claim the semantic web guys don't know where to get their data from.