Wired
Facebook was stealth testing a new ‘Find Friends Nearby’ feature over the weekend that would show the Facebook users who were around you—think Find My Friends on iOS or Where You At? on Boost Mobile—but has now pulled the feature completely.
The service was admittedly half-baked, there was no official announcement about the Find Friends Nearby and when we tried playing around with it on our phones, we found nobody near us. Facebook told Wired that Find Friends Nearby wasn’t actually intended to launch. Specifically Facebook said:
“This wasn’t a formal release—this was just something that a few engineers were testing. With all tests, some get released as full products, others don’t. Nothing more to say on this for now, but we’ll communicate to everyone when there is something to say.”
Looks like we’ll have to wait a little bit longer to creep on Facebook users in our vicinity. [Wired]
This Is What Your Wikipedia Edits Look Like
Source: http://gizmodo.com/5495353/this-is-what-your-wikipedia-edits-look-like
Normally I’d file this image under our “what is this” image cache, but as you’ve already clocked, it’s somehow related to our Memory [Forever] theme. Those pretty colors are a visualization of the thousands of Wikipedia edits made by a bot.
It’s not just a one-off visualization for adding to our Tumblrs either. It’s the work of Many Eyes, a website set up by a pair of computer scientists at IBM, to catalog visual representations of data. Looking at the site now, two years after Wired brought it to light and interviewed founder Martin Wattenberg, recent artworks tackle the issue of migration in the US, and cremations.
When asked by Wired back then why he’s so keen to visualize data, Watterberg responded that:
“Language is one of the best data-compression mechanisms we have. The information contained in literature, or even email, encodes our identity as human beings. The entire literary canon may be smaller than what comes out of particle accelerators or models of the human brain, but the meaning coded into words can’t be measured in bytes. It’s deeply compressed. Twelve words from Voltaire can hold a lifetime of experience.”
Wikipedia data remains a favorite for them though, thanks to the “idea of completeness” Watterberg talks about, that even though all the data on Wikipedia equals a terabyte or so, “it’s huge in terms of encompassing human knowledge.” [Many Eyes via Wired]
Memory [Forever] is our week-long consideration of what it really means when our memories, encoded in bits, flow in a million directions, and might truly live forever.
How Google Crunches All That Data
Source: http://gizmodo.com/5495097/how-google-crunches-all-that-data
If data centers are the brains of an information company, then Google is one of the brainiest there is. Though always evolving, it is, fundamentally, in the business of knowing everything. Here are some of the ways it stays sharp.
For tackling massive amounts of data, the main weapon in Google’s arsenal is MapReduce, a system developed by the company itself. Whereas other frameworks require a thoroughly tagged and rigorously organized database, MapReduce breaks the process down into simple steps, allowing it to deal with any type of data, which it distributes across a legion of machines.
Looking at MapReduce in 2008, Wired imagined the task of determining word frequency in Google Books. As its name would suggest, the MapReduce magic comes from two main steps: mapping and reducing.
The first of these, the mapping, is where MapReduce is unique. A master computer evaluates the request and then divvies it up into smaller, more manageable “sub-problems,” which are assigned to other computers. These sub-problems, in turn, may be divided up even further, depending on the complexity of the data set. In our example, the entirety of Google Books would be split, say, by author (but more likely by the order in which they were scanned, or something like that) and distributed to the worker computers.
Then the data is saved. To maximize efficiency, it remains on the worker computers’ local hard drives, as opposed to being sent, the whole petabyte-scale mess of it, back to some central location. Then comes the second central step: reduction. Other worker machines are assigned specifically to the task of grabbing the data from the computers that crunched it and paring it down to a format suitable for solving the problem at hand. In the Google Books example, this second set of machines would reduce and compile the processed data into lists of individual words and the frequency with which they appeared across Google’s digital library.
The finished product of the MapReduce system is, as Wired says, a “data set about your data,” one that has been crafted specifically to answer the initial question. In this case, the new data set would let you query any word and see how often it appeared in Google Books.
MapReduce is one way in which Google manipulates its massive amounts of data, sorting and resorting it into different sets that reveal new meanings and have unique uses. But another Herculean task Google faces is dealing with data that’s not already on its machines. It’s one of the most daunting data sets of all: the internet.
Last month, Wired got a rare look at the “algorithm that rules the web,” and the gist of it is that there is no single, set algorithm. Rather, Google rules the internet by constantly refining its search technologies, charting new territories like social media and refining the ones in which users tread most often with personalized searches.
But of course it’s not just about matching the terms people search for to the web sites that contain them. Amit Singhal, a Google Search guru, explains, “you are not matching words; you are actually trying to match meaning.”
Words are a finite data set. And you don’t need an entire data center to store them—a dictionary does just fine. But meaning is perhaps the most profound data set humanity has ever produced, and it’s one we’re charged with managing every day. Our own mental MapReduce probes for intent and scans for context, informing how we respond to the world around us.
In a sense, Google’s memory may be better than any one individual’s, and complex frameworks like MapReduce ensure that it will only continue to outpace us in that respect. But in terms of the capacity to process meaning, in all of its nuance, any one person could outperform all the machines in the Googleplex. For now, anyway. [Wired, Wikipedia, and Wired]
Image credit CNET
Memory [Forever] is our week-long consideration of what it really means when our memories, encoded in bits, flow in a million directions, and might truly live forever.
Inside Google’s Secret Search Algorithm
Source: http://feeds.gawker.com/~r/gizmodo/full/~3/zzkIcilnJp4/inside-googles-secret-search-algorithm
Wired’s Steven Levy takes us inside the “algorithm that rules the web“—Google’s search algorithm, of course—and if you use Google, it’s kind of a must-read. PageRank? That’s so 1997.
It’s known that Google constantly updates the algorithm, with 550 improvements this year—to deliver smarter results and weed out the crap—but there are a few major updates in its history that have significantly altered Google’s search, distilled in a helpful chart in the Wired piece. For instance, in 2001, they completely rewrote the algorithm; in 2003, they added local connectivity analysis; in 2005, results got personal; and most recently, they’ve added in real-time search for Twitter and blog posts.
The sum of everything Google’s worked on—the quest to understand what you mean, not what you say—can be boiled down to this:
This is the hard-won realization from inside the Google search engine, culled from the data generated by billions of searches: a rock is a rock. It’s also a stone, and it could be a boulder. Spell it “rokc” and it’s still a rock. But put “little” in front of it and it’s the capital of Arkansas. Which is not an ark. Unless Noah is around. “The holy grail of search is to understand what the user wants,” Singhal says. “Then you are not matching words; you are actually trying to match meaning.”
Oh, and by the way, you’re a guinea pig every time you search for something, if you hadn’t guessed as much already. Google engineer Patrick Riley tells Levy, “On most Google queries, you’re actually in multiple control or experimental groups simultaneously.” It lets them constantly experiment on a smaller scale—even if they’re only conducting a particular experiment on .001 percent of queries, that’s a lot of data.
Be sure to check out the whole piece, it’s ridiculously fascinating, and borders on self-knowledge, given how much we all use Google (sorry, Bing). [Wired, Sweet graphic by Wired's Mauricio Alejo]
Additional Information on Real Time Bidding
![]()
Digital Consigliere
Tags
Popular Posts
- Netflix vs Blockbuster - Perfect example of an industry replaced by a more efficient version of itself
- Coke vs Pepsi vs Dr Pepper
- Marketing Costs Normalized to CPM Basis for Comparison
- The Top Endorsement Earners In Each Sport
- AOL's Plan To Steal TV Ad Dollars Is Totally Working
- 3G calling, no registration, and totally free
- drag2share: The Most Pinned Brand On Pinterest Doesn't Even Use A Pinterest Account [THE BRIEF]
- Groupon launches Breadcrumb iPad app, vows to not be a typical POS
- HP Mini 311 Nvidia ION Netbook Hackintosh'ed
Published Articles by Dr. Augustine Fou
- #SESNY: Toward a Performance Mindset for All Advertising
- Tips for Marketers Selecting a Digital Agency
- Context Is Not King or Queen; It's Just Necessary
- 2013 New Year's Digital Marketing Resolutions
- The Good, Bad, and Ugly of Online Campaign Ratings and eGRPs
- Why You Should Banish the Net Promoter Score Immediately
- Digital Strategy To-MAY-to vs. To-MAH-to
- The Agency-Client Relationship is Forever Changed
- Targeting vs. Privacy - Who Will Win?
- Digital + Traditional = Unified Marketing
Pages
Archives
- May 2013 (65)
- April 2013 (70)
- March 2013 (114)
- February 2013 (89)
- January 2013 (136)
- December 2012 (96)
- November 2012 (130)
- October 2012 (147)
- September 2012 (94)
- August 2012 (92)
- July 2012 (112)
- June 2012 (71)
- May 2012 (82)
- April 2012 (80)
- March 2012 (122)
- February 2012 (114)
- January 2012 (129)
- December 2011 (60)
- November 2011 (54)
- October 2011 (29)
- September 2011 (17)
- August 2011 (30)
- July 2011 (18)
- June 2011 (19)
- May 2011 (23)
- April 2011 (23)
- March 2011 (52)
- February 2011 (69)
- January 2011 (108)
- December 2010 (82)
- November 2010 (67)
- October 2010 (68)
- September 2010 (44)
- August 2010 (101)
- July 2010 (61)
- June 2010 (28)
- May 2010 (28)
- April 2010 (26)
- March 2010 (33)
- February 2010 (21)
- January 2010 (12)
- December 2009 (4)
- November 2009 (2)
- October 2009 (14)
- September 2009 (6)
- August 2009 (19)
- July 2009 (34)
- June 2009 (11)
- May 2009 (4)
- April 2009 (6)
- March 2009 (13)
- February 2009 (32)
- January 2009 (25)
- December 2008 (1)
- October 2008 (1)
- June 2008 (1)
- November 2007 (1)
Prototype Web Services
- drag2share – quickly share news items by drag and drop on email addresses
- LivePhotoFrame – upload and remotely manage a digital photo frame via unique URL
- MedleyTuner – create a continuous listening experience by uploading mp3s
- MusicSamplr – discover new artists and music, listen to samples
- SharedMost – what links on ANY webpage are shared most?
- Signatory – sign and date a document and verify it hasn't been altered since that exact time.
- WebTeleprompter – just what it says it is
Tim “I helped invent the Internet” Berners-Lee testified before a federal jury earlier this week, tearing into the validity of a key patent Eolas Technologies’ was exploiting to sue multiple web companies for $600 million. He must have been persuasive because the court took mere hours to reach its decision.

