Archive for April, 2006

My head asplode

Really. All over my desk.
Critical deadlines + critical applications + developer’s day off = predictable meltdown of critical app.
WHEE.

Comments (1)

Today’s link roundup: Nielson’s UK search data, Scholarly Search from MS, whois.sc changes name, Orkut big in Brazil, certified email changes tack, Google using what to gauge quality?, Web 2.0: Underpants Gnomes business model

Nielsen’s UK Search Data
“It is important not to forget that Google’s phenomenal success has had implications and benefits for others far beyond Google itself. Take the university sector for example – Google drove over 5.6 million visitors to university websites in January alone, which represented 62% of the entire audience for that sector. This means that only the military sector (63%) owed a greater percentage of its entire audience to Google.”
Report
Article

New Scholarly Search from Microsoft
“Unlike Google Scholar, which crawls the web for academic content, Windows Live Academic Search works closely with publishers and uses structured feeds to build its index. As such, all content accessed through the service comes directly from a trusted source—namely, the publisher of a scholarly journal. The new service addresses two needs of the academic community that have traditionally been under-served, according to Danielle Tiedt, general manager of Windows Live Premium Search. Academic users want tools to help them fine tune search results, and are interested in getting more information on a search result before clicking off to specific article.”

Whois.sc Changes Name To DomainTools.com
“One of ResourceShelf’s favorite WHOIS domain name resources has changed it’s name and has a new look. Result pages are full of data. Also, all of the inexpensive ($15/month) but powerful services like historical WHOIS info (back to 2002) and the Mark Alert domain name tracking service available.”

Orkut explodes in Brazil
“About 11 million of Orkut’s more than 15 million users are registered as living in Brazil — a remarkable figure given that studies have estimated that only about 12 million Brazilians use the Internet from home.”

Certified email not intended to reduce spam after all
“Rather than helping to reduce spam Gingras claimed that the point is to allow users to verify who important messages are really from, like a message from your bank or credit card company. “To suggest that the introduction of CertifiedEmail is going to prevent spammers from sending spam or phishers from trying to phish — we have not said it, nor would any expert say it,” Gingras told DM News. The Goodmail CEO’s statement at the hearing, however, differs from recent remarks by AOL concerning the program’s benefits. “As we get ready to testify at the hearing … we are also working diligently to protect our members’ safety and security by preparing implementation of the anti-spam, anti-phishing CertifiedEmail program,” AOL spokesman Nicholas J. Graham said in a March 30 report on DMNews.com.”
Bill comments: I find this interesting as it indicates to me that perhaps trust is becoming a larger issue than spam. When a big fish like this changes stance so radically, they’ve smelled something in the wind. Or something is broken. Hehe.

Google using traffic analysis and user feedback to gauge quality?
‘I don’t think Google would want to base a ton of the overall relevancy algorithm on site popularity (and clearly they don’t since the top results are not always the most popuar sites), but they can and may use traffic patterns and searcher feedback to filter out junk sites. And it may help certain types of link spam stick out (ie: a site that just picked up 50,000 backlinks but few of them drive any traffic) may be a red flag for spam.”

Web 2.0: Underpants Gnomes Business Models
“Here’s the audio file if you missed the South Park episode that’s from, but basically it’s the following business model:
1. Collect underpants.
2. ????????
3. Profit!
It’s become way too clear now that a lot of new web companies are following this business model, and the majority of them will probably fail at some point. For example:
1. Share photos
2. ??????
3. Profit!”

Comments

Sneaky Geeky

Comments (1)

New study shows mathematicians need to get out a LOT more often

“(S+C) x (B+F)/T = V is the formula that describes the “ideal female ass” in shape, bounce, firmness and symmetry, according to psychology lecturer David Holmes of Manchester Metropolitan University in England:
S is the overall shape or droopiness of the bottom, C represents how spherical the buttocks are, B measures muscular wobble or bounce, while F records the firmness. V is the hip to waist ratio, or symmetry of the bottom, and T measures the skin texture and presence of cellulite.”

Y’know, I work with mathematicians. I guess they really can get away with claiming they are “thinking about equations” when I catch them staring at my booty

Comments (1)

Today’s link roundup: Adversarial Information Retrieval workshop, ACM SIGIR conference, Searcher Behavior Research Update, DNS cache poisoning and link hijacking are the new black!

AIRWeb (Adversarial Information Retrieval) Call for Papers
The workshop will be held on 10 or 11 August, during the ACM SIGIR Conference on Research and Development on Information Retrieval on 6-11 August in Seattle.
“The attraction of hundreds of millions of web searches per day provides significant incentive for many content providers to do whatever is necessary to rank highly in search engine results, while search engine providers want to provide the most accurate results. The conflicting goals of search and content providers is adversarial, and the use of techniques that push rankings higher than they belong is often called search engine spam. Such methods typically include textual as well as link-based techniques, or their combination. This, the second AIRWeb workshop, builds on last year’s successful meeting in Chiba, Japan as part of WWW2005. This year we solicit submissions on any aspect of adversarial information retrieval on the Web.”
Matt Cutts article
AIRWeb workshop site

29th Annual International ACM SIGIR - Conference on Research & Development on Information Retrieval
“SIGIR is the major international forum for the presentation of new research results and the demonstration of new systems and techniques in the broad field of information retrieval. The 29th Annual International ACM SIGIR Conference will be held at the University of Washington Campus in Seattle, WA, August 6-11, 2006.”

Searcher Behavior Research Update
“Two new studies examining how people search show that internet users are becoming more discriminating, with important implications for search marketers. As part of ongoing work conducted by Jupiter Research and sponsored by iProspect, “The iProspect Search Engine User Behavior Study” found that 62% of search engine users click on a search result within the first page of results, and a full 90% of users click on a result within the first three pages of search results. These figures were just 48% and 81% in 2002, based on similar research iProspect did at the time.”

DNS cache poisoning is the new black?
“Are you open for DNS Cache Poisoning? Are your SERPs being redirected because an attacker has compromised your DNS and is now stealing your traffic? And possibly destroying your domain in the process?”

Or maybe link hijacking is the new black…
“I was notified by a person in New Jersey (I’m in Washington state and my web site is hosted in Virginia) that when he clicked on my link in the search engine results sometimes he would be sent to my site, but other times he would be sent to an adult site. Sometimes the adult site would come up first and other times it would take several clicks on the same link, but the adult site would come up eventually.”

Comments (2)

Today’s link roundup: a week of Battelle links, new text search algorithm for Google. Google health?, Keyword volume data, Gary Flake, MSN search down, MSN master of the obvious?

John Battelle’s Friday Update
Link roundup from John Battelle.

Google gets new text search algorithm
“Orion finds pages where the content is about a topic strongly related to the key word. It then returns a section of the page, and lists other topics related to the key word so the user can pick the most relevant. The results of the query are displayed immediately in the form of expanded text extracts, giving the searcher the relevant information without having to go to the website - although there is still that option.”
SearchEngineWatch comments
Local news article
WebMaster World comments

Google Health, maybe
“Health has been an area of interest at Google for some time. We (including Adam) have been doing a variety of research in this area, including how to improve the quality of health-related search results.”

Keyword search volume data
“SEO Question: Are you aware of a tool or a service that can provide reliable search volume history for certain keywords? Like how many searches there were for “keyword phrase” in each month of 2005.”

A Frank Interview with Gary Flake
“The format is simple - I send an opening question to the luminary in question via email, they respond, and we go from there. First up is Gary Flake, a veteran of Overture, Yahoo and now Microsoft’s vaunted research labs (he’s founder and director of the new “Live Labs.”) Gary and I have known each other since I first began work on the book, and he’s always had a refreshingly frank outlook. I expected that to be tempered by a year at the world’s largest (and oft-criticized) software company, but I was wrong. If anything, Gary has become more outspoken. I’ve bolded the really juicy bits, but see for yourself….”

MSN Search down for four hours
“Microsoft spokesman said the outage began at about 8 a.m. Pacific time. At noon, the service was still down, but shortly afterward began working again.
The spokesman said company technicians have been working on the problem, but have not yet concluded what caused the shutdown in the first place.”

MSN, master of the obvious?
“Well, it turns out if you have clearly computer generated content with exactly the same number of links on each page, all pages the same size and over a million pages from 116,654 hosts that all share the same IP, they can detect you. Wow! That’s some pretty sophisticated spam detection technology they’ve got there.
In addition, this revolutionary new search technology can detect:
1. Corn in Nebraska
2. If a politician is lying”

Comments

Beyond sanity

You know sometimes you get so crazy and busy and everyone is crunching and crazy busy and you’re all so intense about the work and making everything happen and you all descend together into sleep deprivation and stress overload and everyone goes crazy in exactly the same way at exactly the same time?

This would explain the bigwig meeting the other day in which every error was explained away as, “oh you were on crack when you did that.” “Right, yes, of course.” It ended with me saying, (in my best sultry stewardess voice), “This meeting brought to you by CRACK, the San Francisco treat!”

And the email to a developer today in which I wrote,

One test to catch the bugs
One dev to fix them
One patch to bring them all and in the server mix them

I get paid to do this.* I LOVE MY JOB.

*Okay, the poetry was a freebie.

Comments

Job satisfaction

Comments

These boots were made for walkin’…. very slowly and carefully

Today I’m wearing my brand-new hooker boots (thanks eBay!). Five-inch heels and I’m still shorter than everyone else in the damn office.

Comments (6)

Today’s link roundup: a year of John Battelle, the last name test, Zeitgeist inaccurate?, Tim!, Google Related Links, April Fool’s on the web, and porn, porn, porn

John Battelle: a year of news in one post
“Here’s my first cut take on things which mattered in search, media, and technology over the past year. What have I missed?!”

The Last Name Test
“Turns out, for my name, there is arguably a more relevant result - a well known research and development company bearing the same name. It may not be more relevant for *me* - but web search, at present, is still a brute force application. The question is not whether it’s relevant for me alone (though it should be…), the question, at present, is whether it’s most relevant for the *most* people. Thanks to how Ask works, what I’ve learned is that for the query “Battelle,” more people find the institute relevant than my blog.”

Just How Accurate Is The Google Zeitgeist?
“Accuracy of Google Zeitgeist over at our Search Engine Watch Forums is a nice “what gives” about some oddities in the international version of the Google Zeitgeist, where it gives you a rundown on search behavior in various countries. For instance, why is viagra so hot in Singapore — and why do links from the Zeitgeist actually bring up Google South Africa? It got me to give the lists for each country a second look, and I was scratching my head as well. For example, “national lottery” is the top popular query for the United Kingdom in Feburary 2006? Really? Somehow, I doubt it.”

More Tim Berners-Lee than you can shake a router at
“Tim Berners-Lee, inventor of the Web, gave a stirring speech at the Oxford Internet Institute that makes subjects related to Internet freedom accessible to non-geeks and geeks alike. The audio is available as an MP3.”
Tim Berners-Lee has a blog

Google Related Links
“The service puts a little box on your page where Google analyzes the content to show related searches, news and web pages to your visitors. And for helping promote Google in this way, you get …. a little box to put on your page.”
Article
Site

Every single April Fool’s day internet joke
Truly the good, the bad, and the ugly.
The short list (less complete, more funny)
The full list (more complete, less funny)
April Fool’s in the world of search

Onanism 2.0
“In a long and thoughtful article on the internet pornography phenomenon, Adrian Turpin notes that the number of porn pages on the web reached 260 million in 2003, up from 14 million in 1998, and that online porn sales hit $2.5 billion in 2005, well over double the sales of music downloads.”

Comments