intresting info about the deep web



We are used to believe that what we see represents the ultimate truth. But, in many cases, what we see is only the tip of the iceberg, the bigger part being hidden or hard to discover. Such is the case with the Internet, or the Web. For many out there, the Internet’s main piece is represented by Google and other search engines, as these are the main tools to obtain information from various websites.
What we know about the Internet is what website or database owners allow us to know. This is known and referred to as the Surface Web and represents that part of the web that is indexed by search engines. What doesn’t get indexed because its owners want to keep the information private and discrete is called the Deep Web (other synonyms: Deepnet, Invisible Web, Undernet, Hidden Web) and it actually represents the bigger part of the iceberg. What we know and see is only its tip.
The Deep and Invisible Web
One of the most interesting definitions for the Invisible Web comes from Mike Bergman, founder of BrightPlanet:
searching on the Internet today can be compared to dragging a net across the surface of the ocean: a great deal may be caught in the net, but there is a wealth of information that is deep and therefore missed
The Deep Web is also consisted of websites that require registrations and login, limited or scripted content and even websites that don’t have any hyperlinks pointing to them. From my point of view, a big part of the Deep Web should be perceived as the “undiscovered web”. It is thought that the Deep Web is the place where people use anonymity to share viruses and take part in other illegal activities such as forbidden types of pornography.
Related Article: How to do Anonymous Web Surfing
But the Deep or Invisible Web can be browsed. Meet The Onion Router or Tor. It represents an American volunteer-run project based on free software that offers anonymous online communication. You’d be surprised to find out that many who use Tor are secret service “field agents”, “law enforcement officers”. Tor allows:
officials to surf questionable websites and services without leaving tell-tale tracks, but also “activists and whistleblowers”, for example “environmental groups [who] are increasingly falling under surveillance in the US under laws meant to protect against terrorism”. Tor, in short, is used both by the American state and by some of its fiercest opponents. On the hidden internet, political life can be as labyrinthine as in a novel by Thomas Pynchon.
Interesting stats about the Hidden Web
For some, the hidden web is a great occasion to be able to surf anonymously, if they were to use such a tool as Tor. But for others, it represents the perfect environment to launch virus attacks or to do other illegal activities. As we’ve seen, it is even used by secret agents in their activities, so that they won’t leave tracks. Here are some more interesting stats about it
The Deep Web is is thought to be almost 500 times bigger than the World Wide Web
The World Wide Web contains 19 terabytes of information whereas the Hidden Web has7,500 terabytes
The Invisible Web has almost 550 billion individual documents and the WWW has 1 billion
There are more than 200, 000 Deep Web websites
The Deep Web represents the largest growing category with new information on the Internet.
There is also quality content on the Deep Web and it is 1,000 to 2,000 times greater than the WWW
The Deep Web seems to be very relevant to the information, as it doesn’t follow by search engines’ rules
Even if it’s called the Invisible Web, 95% of it is public, free information
Under the Surface Web we don’t find only the activity of hackers, drug dealers, assassins, government agents, terrorists, perverts or FBI agents but there’s also valuable information coming from:
scientists
astronomers
physicists
sociologists
and even revolutionaries
The Hidden Web has made possible the appearance of WikiLeaks, for example. This article has some more interesting information about the Deep Web. Now you know – what you see is on the Internet is not everything, there’s much more to it but be careful what you’re doing. If you have dark thoughts on your mind, then you better leave them because, as anonymous as the Deep Web might not be, the government can still catch you. I suggest to go under the Surface of the Web to discover interesting work of scientists or to participate at “underground” discussions.


more info

      

homeaboutcontactThe Deep Web
The deep web is usually defined as the content on the Web not accessible through a search on general search engines. This content is sometimes also referred to as the hidden or invisible web.
The Web is a complex entity that contains information from a variety of source types and includes an evolving mix of different file types and media. It is much more than static, self-contained Web pages. In fact, the part of the Web that is not static, and is served dynamically "on the fly," is far larger than the static documents that many associate with the Web.
The concept of the deep Web is becoming more complex as search engines have found ways to integrate deep Web content into their central search function. This includes everything from airline flights to news to stock quotations to addresses to maps to activities on Facebook accounts. In the screenshot below, notice the various deep Web sources offered by Google, including images, maps, news, video, shopping, scholarly content, blogs, and so on. However, even a search engine as far-reaching as Google provides access to only a very small part of the deep Web.

Content on the deep Web
When we refer to the deep Web, we are usually talking about the following:
•The content of databases. Databases contain information stored in tables created by such programs as Access, Oracle, SQL Server, and MySQL. (There are other types of databases, but we will focus on database tables for the sake of simplicity.) Information stored in databases is accessible only by query. In other words, the database must somehow be searched and the data retrieved and then displayed on a Web page. This is distinct from static, self-contained Web pages, which can be accessed directly. A significant amount of valuable information on the Web is generated from databases.
•Non-text files such as multimedia, images, software, and documents in formats such as Portable Document Format (PDF) and Microsoft Word. For example, see Digital Image Resources on the Deep Web for a good indication of what is out there for images.
•Content available on sites protected by passwords or other restrictions. Some of this is fee-based content, such as subscription content paid for by libraries or private companies and available to their users based on various authentication schemes.
•Special content not presented as Web pages, such as full text articles and books
•Dynamically-changing, updated content, such as news and airline flights
This is usually the basic,"traditional" list. In these days of the social Web, let's consider adding new content to our list of deep Web sources. For example:
•Blog postings
•Comments
•Discussions and other communication activities on social networking sites, for example Facebook and Twitter
•Bookmarks and citations stored on social bookmarking sites
As you can see, based on these few examples, the deep Web is expanding.
Tips for dealing with deep Web content
•Vertical search can solve some of the problems with the deep Web. With vertical search, you can query a collection of data focused on a specific topic, industry, type of content, geographical location, language, file type, website, piece of data, and so on. For example, consider MedNar and PubMed to search for medical topics. On the social Web, there are search engines for blogs, RSS feeds, Twitter content, and so on.
Tip! See the tutorial on Vertical Search Engines for more information.
•Use a general search engine to locate a vertical search engine. For example, a Google search on "stock market search" will retrieve sites that allow you to search for current stock prices, market news, etc. This may be thought of as split level searching. For the first level, search for the database site. For the second level, go to the site and search the database itself for the information you want.
•A number of general search engines will search the deep Web for related content subsequent to an initial search. For example, try a search on Google for "World Trade Center" and select the Images tab. This will retrieve many pages of images of the World Trade Center. Look for this type of feature on other search engines.
•Try to figure out which kind of information might be stored in a database.. There is no general rule. But think about large listings of things with a common theme. A few examples of databased content include:
?phone books
?"people finders" such as lists of professionals such as doctors or lawyers
?patents
?laws
?dictionary definitions
?items for sale in a Web store or on Web-based auctions
?digital exhibits
?images and multimedia
?full text articles and books
•Information that is new and dynamically changing in content will appear on the deep Web. Look to the deep Web for late breaking items, such as:
?news
?job postings
?available airline flights, hotel rooms
?stock and bond prices, market averages
•The social Web often jumps on a late-breaking situation with news items and commentary. Blogs, Facebook, Twitter, and other social networking environments sometimes get out the word before more traditional sources.
•Topical coverage on the deep Web is extremely varied. This presents a challenge, since it is impossible to anticipate exactly what might turn up.

Reacties