Monday, July 28, 2008

Google URL Index Hits 1 Trillion


We knew the web was big...

7/25/2008 10:12:00 AM
We've known it for a long time: the web is big. The first Google index in 1998 already had 26 million pages, and by 2000 the Google index reached the one billion mark. Over the last eight years, we've seen a lot of big numbers about how much content is really out there. Recently, even our search engineers stopped in awe about just how big the web is these days -- when our systems that process links on the web to find new content hit a milestone: 1 trillion (as in 1,000,000,000,000) unique URLs on the web at once!

How do we find all those pages? We start at a set of well-connected initial pages and follow each of their links to new pages. Then we follow the links on those new pages to even more pages and so on, until we have a huge list of links. In fact, we found even more than 1 trillion individual links, but not all of them lead to unique web pages. Many pages have multiple URLs with exactly the same content or URLs that are auto-generated copies of each other. Even after removing those exact duplicates, we saw a trillion unique URLs, and the number of individual web pages out there is growing by several billion pages per day.

Read Complete Article

Sunday, July 27, 2008

Colorful!


Photo By Pixdaus

PDF Miner

What's It?

PDFMiner is a suite of programs that aims to help analyzing text data from PDF documents. It includes a PDF parser, a PDF renderer (though only rendering text is supported for now), and a couple of nice tools to extract texts. Unlike other PDF-related tools, it allows to obtain the exact location of texts in a page, as well as other layout information such as font size or font name, which could be useful for analyzing the document.

Features:

  • Written entirely in Python. (for version 2.5 or newer)
  • Supports up to PDF-1.7 specification.
  • Supports Non-ASCII languages and vertical writing scripts.
  • Supports Various font types (Type1, TrueType, Type3, and CID).
  • Supports Basic encryption (RC4).
  • Supports PDF to HTML conversion.
  • Supports Outline (TOC) extraction.
  • Supports Tagged contents extraction.

"Kuku Kloc" Online Alarm Clock

Kuku Kloc

Wednesday, July 23, 2008

Print Free Graph Paper


Save yourself money and a trip to the store! Print graph paper free from your computer. This site is perfect for science and math homework, craft projects and other graph paper needs. All graph paper files are optimized PDF documents requiring Adobe Reader for viewing.

Take advantage of your printing flexibility; print on transparency film for sharp graph paper overheads, or waterproof paper for field data-collecting.

Tuesday, July 22, 2008

GigaPan


GigaPan is the newest development of the Global Connection Project, which aims to help us meet our neighbors across the globe, and learn about our planet itself. GigaPan will help bring distant communities and peoples together through images that have so much detail that they are, themselves, the objects of exploration, discovery and wonder. We believe that enabling people to explore, experience, and share each other's worlds can be a transforming experience. Our mission is to make all aspects of the GigaPan experience accessible and affordable to the broadest possible community.

GigaPan consists of three technological developments: a robotic camera mount for capturing very high-resolution (gigapixel and up) panoramic images using a standard digital camera; custom software for constructing very high-resolution gigapixel panoramas; and, a new type of website for exploring, sharing and commenting on gigapixel panoramas and the detail our users will discover within them. The GigaPan website allows hosting and sharing all kinds of panoramas, and so the robotic GigaPan mount is recommended but is certainly not required to be part of this community.

Planet eBook


Welcome to Planet eBook, the home of free classic literature. We offer an assortment of classic novels and books in electronic form which you are free give to your friends, classmates, students, anyone!

Existing free eBooks on the Web tend to be well beneath the quality of paper books, making them more difficult and less pleasurable to read. At Planet eBook we're trying to change this. Our goal is to publish a small selection of high-quality eBooks — each a genuine alternative for readers wanting to enjoy reading a book without having to pay for it. The books we publish are all in the public domain so there is no real need for readers to continue to pay for them.

You're welcome to print them out for classes and courses, distribute them on CD/DVDs, offer them for download from your website, and so on — really, you can share them however you like, as long as you don't charge money for them.

Sunday, July 20, 2008

'Dark Knight' sets opening weekend box office record


A Warner Bros. executive says the Batman sequel "The Dark Knight" has taken in $155.34 million to top "Spider-Man 3" for best opening weekend ever at the box office.

The figures released Sunday show "The Dark Knight" more than $4 million ahead of the $151.1 million first weekend for "Spider-Man 3" in May 2007.

Studio distribution chief Dan Fellman says "The Dark Knight" also broke the "Spider-Man 3" record for best debut in IMAX large-screen theaters with $6.2 million. "Spider-Man 3" opened with $4.7 million in IMAX cinemas.

Stoked by fan fever over the manic performance of the late Heath Ledger as the Joker, "The Dark Knight" also set a one-day box office record with $66.4 million on opening day, Fellman said Saturday.

The movie's Friday haul surpassed the previous record of $59.8 million set last year by "Spider-Man 3."

Via CNN

WebMail Notifier For Firefox



WebMail Notifier checks your webmail accounts and notifies the number of unread emails...
Supports : gmail, yahoo, hotmail, daum, naver, empas, nate and more...

Download link