Search Engine Indexing and PageRank

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/70

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

71 Terms

1
New cards

Search Engine

System that locates resources on the web.

2
New cards

Index

Record of resources on the World Wide Web.

3
New cards

PageRank Algorithm

Method to rank web pages based on relevance.

4
New cards

Larry Page

Co-founder of Google and PageRank algorithm.

5
New cards

Sergey Brin

Co-founder of Google and web technologies innovator.

6
New cards

Web Crawler

Bot that discovers and records web pages.

<p>Bot that discovers and records web pages.</p>
7
New cards

Hyperlink

Link that connects one web page to another.

8
New cards

Publicly Available Web Pages

Web pages accessible to all internet users.

9
New cards

Search Engine Database

Storage for indexed web page information.

10
New cards

URL

Uniform Resource Locator; web page address.

11
New cards

Resource Quality

Credibility assessment of a web page.

12
New cards

Web Search

Querying the search engine's index for information.

13
New cards

Estimated Web Pages

Trillions of pages exist on the web.

14
New cards

Google's Index Size

Over 100 petabytes of indexed data.

15
New cards

Search Engine Examples

Google, Bing, Yahoo, Baidu.

16
New cards

Crawling Process

Following hyperlinks to discover new web pages.

<p>Following hyperlinks to discover new web pages.</p>
17
New cards

Last Updated

Timestamp indicating when a resource was modified.

18
New cards

Efficient Resource Location

Search engines streamline finding relevant information.

19
New cards

Information Retrieval

Process of obtaining information from the index.

20
New cards

Web Technologies

Methods and tools for web development and browsing.

21
New cards

Trillions of Pages

Approximate number of web pages on the internet.

22
New cards

Index

Tracks content of each book page.

23
New cards

Search Engine Indexing

Organizes web content for efficient retrieval.

24
New cards

PageRank

Ranks web pages by relevance and usefulness.

25
New cards

Meta Tags

HTML tags describing web page content.

26
New cards

Web Crawlers

Automated programs that index web pages.

<p>Automated programs that index web pages.</p>
27
New cards

Search Term

Keywords entered to retrieve search results.

28
New cards

Inbound Links

Links from other pages to a specific page.

29
New cards

Damping Factor

Probability of reaching a page, usually 0.85.

30
New cards

PageRank Algorithm

Calculates page importance based on links.

31
New cards

PR(A)

PageRank of page A in the algorithm.

32
New cards

C(Ti)

Number of outbound links on page Ti.

33
New cards

PR(Ti)

PageRank of pages linking to page A.

34
New cards

Search Results

List of pages returned by search engines.

35
New cards

Web Browser

Software used to access the internet.

36
New cards

Authoritative Source

Page with higher credibility and relevance.

37
New cards

Algorithm

Set of rules for calculating PageRank.

38
New cards

Relevance Calculation

Determines how search results are ranked.

39
New cards

Search Engine

Tool for finding information on the web.

40
New cards

Web Page

Document accessible via the internet.

41
New cards

Results Listing

Order in which search results are displayed.

42
New cards

Quality of Links

Influences the PageRank of a web page.

43
New cards

Activity Example

Testing search engine functionality with queries.

44
New cards

PageRank

Algorithm assessing webpage relevance in search results.

45
New cards

Indexing

Process of organizing web content for search engines.

46
New cards

Damping factor (d)

Value representing probability of continuing to link.

47
New cards

Initial PageRank

Starting assumption of PageRank value, often 1.

48
New cards

Iteration

Repetitive calculation to refine PageRank values.

49
New cards

Inbound link

Link from another page directing to this page.

50
New cards

Outbound link

Link from this page directing to another page.

51
New cards

Relevance factors

Criteria affecting a webpage's PageRank score.

52
New cards

Domain name relevance

Importance of domain name to search query.

53
New cards

Keyword frequency

How often keywords appear on a webpage.

54
New cards

Page age

Duration since a webpage was created.

55
New cards

Content update frequency

How often the webpage content is refreshed.

56
New cards

Magnitude of updates

Size and significance of content changes.

57
New cards

H1 tags

HTML tags indicating main headings on a page.

58
New cards

PageRank formula

PR(A) = (1-d) + d (PR(Ti)/C(Ti)).

59
New cards

PageRank value

Numerical score indicating webpage importance.

60
New cards

Webpage A

Example page linked to Webpage B.

61
New cards

Webpage B

Example page linked to Webpage A and others.

62
New cards

Page C

Webpage with no outbound links.

63
New cards

Page D

Webpage linking back to Page A.

64
New cards

Final PageRank

Stable PageRank value after multiple iterations.

65
New cards

Search term frequency

How often a search term appears in queries.

66
New cards

Link reciprocity

Mutual linking between two webpages.

67
New cards

PageRank convergence

Point where PageRank values stabilize after iterations.

68
New cards

Search engine result ranking

Order of webpages displayed in search results.

69
New cards

Webpage relevance

Degree to which a webpage matches search intent.

70
New cards

PageRank calculation

Process of determining a page's relevance score.

71
New cards

Total PageRank sum

Combined PageRank values of all pages in scenario.