13 terms

Option C: Web Science 2


Terms in this set (...)

Search Engine
Software that finds information on the internet or World Wide Web
Web Crawler
An automated program that surfs the web and indexes and/or copies websites
Meta tag
Extra information that tags the HTML document
Parallel Web Crawling
Speeding up web crawling by running crawlers on multiple computers at the same time.
Page Rank Algorithm
1. Crawl the internet
2. Save the content and index the content's popular words
3. Identify the links on the page
4. Each link to an already indexed page increases the rank of that linked page
5. Index search results
Basic Search Engine Algorithm
1. Crawl the internet
2. Save meta keywords
3. Save content and popular words
4. Math keywords with content words
Shallow web
Websites and content easily visible to search engines.
Deep web
Websites and content that are dynamic and/or unlinked (eg. private websites)
Search Engine Optimization
The process of affecting the visibility of a website or a web page in a search engine's organic search results.
Ubiquitous Computing
Concept where computing is made to appear everywhere and anywhere
Grid Computing
The collection of computer resources from multiple locations to reach a common goal. Consists of different computers (eg. protein folding).
The ability of making systems and organizations to work together
Cloud Computing
Involves distributed computing over a network where a program or application may run on many connected computers at the same time
Ex: Google Drive, Dropbox, Amazon Web services