Kevin Kelly -- The Technium
Popularity Report
![]() |
|||
![]() |
|||
![]() |
|||
![]() |
|||
![]() |
|||
![]() |
URL Tag Cloud
Bookmark History
Saved by 23 people (-3 private), first by anonymouse user on 2008-06-29
- Naoyamakino on 2009-10-17 - Tags google
- Web-evolution on 2009-10-16 - Tags no_tag
- Amiigo on 2008-09-17 - Tags science , data , analysis , blog
- Ognjen on 2008-08-11 - Tags science , technology , philosophy , kkelly , longtail , searle
- Ezrasf on 2008-07-22 - Tags science , google , technium
Public Sticky notes
Highlighted by rakerman
Highlighted by hnouwens
Highlighted by web-evolution
Highlighted by jangondol
It may turn out that tremendously large volumes of data are sufficient to skip the theory part in order to make a predicted observation. Google was one of the first to notice this. For instance, take Google's spell checker. When you misspell a word when googling, Google suggests the proper spelling. How does it know this? How does it predict the correctly spelled word? It is not because it has a theory of good spelling, or has mastered spelling rules. In fact Google knows nothing about spelling rules at all.
Instead Google operates a very large dataset of observations which show that for any given spelling of a word, x number of people say "yes" when asked if they meant to spell word "y." Google's spelling engine consists entirely of these datapoints, rather than any notion of what correct English spelling is. That is why the same system can correct spelling in any language.
Highlighted by naoyamakino
Highlighted by web-evolution
Google knows nothing about spelling rules at all.
Instead Google operates a very large dataset of observations which show that for any given spelling of a word, x number of people say "yes" when asked if they meant to spell word "y."
Highlighted by jangondol
Highlighted by jangondol
Highlighted by web-evolution
Highlighted by jangondol
Highlighted by jangondol
Petabytes allow us to say: "Correlation is enough." We can stop looking for models. We can analyze the data without hypotheses about what it might show. We can throw the numbers into the biggest computing clusters the world has ever seen and let statistical algorithms find patterns where science cannot.
Highlighted by jangondol
Petabytes allow us to say: "Correlation is enough." We can stop looking for models. We can analyze the data without hypotheses about what it might show. We can throw the numbers into the biggest computing clusters the world has ever seen and let statistical algorithms find patterns where science cannot.
Highlighted by web-evolution
There may be something to this observation. Many sciences such as astronomy, physics, genomics, linguistics, and geology are generating extremely huge datasets and constant streams of data in the petabyte level today. They'll be in the exabyte level in a decade. Using old fashioned "machine learning," computers can extract patterns in this ocean of data that no human could ever possibly detect. These patterns are correlations. They may or may not be causative, but we can learn new things. Therefore they accomplish what science does, although not in the traditional manner.
Highlighted by takuya514
Highlighted by jangondol
Highlighted by takuya514
Highlighted by takuya514
Highlighted by web-evolution
Highlighted by jangondol
Highlighted by takuya514
Highlighted by jangondol
For a long time we were stuck on the idea that the brain somehow contained a "model" of reality, and that AI would be achieved by constructing similar "models." What's a model? There are 2 requirements: 1) Something that works, and 2) Something we understand. Our large, distributed, petabyte-scale creations, whether GenBank or Google, are starting to grasp reality in ways that work just fine but that we don't necessarily understand.
Highlighted by takuya514
Highlighted by takuya514
Highlighted by web-evolution
Highlighted by bibliothecaire
Perhaps understanding and answers are overrated. "The problem with computers," Pablo Picasso is rumored to have said, "is that they only give you answers." These huge data-driven correlative systems will give us lots of answers -- good answers -- but that is all they will give us. That's what the OneComputer does -- gives us good answers. In the coming world of cloud computing perfectly good answers will become a commodity. The real value of the rest of science then becomes asking good questions.
Highlighted by takuya514
Highlighted by jangondol


Public Comment