Questions tagged [pagerank]

PageRank is a graph algorithm that assigns importance to nodes based on their links, and is named after its inventor - Larry Page. The algorithm is frequently applied to web graphs to calculate an importance of each node [url] in the graph.

PageRank is an algorithm to assign importance to nodes in linked data base, and is named after its inventor - Larry Page. The algorithm is frequently used on the web - to calculate an importance of each node [url] in the database.

The algorithm is simulating a random-surfer model. The random surfer starts from a random node in the graph, and can chose to use an out edge from this vertex at probability α, or to jump into a random node at probability 1-α. The score of each node is the probability of the random surfer to be at this node at some point in time.

The algorithm is patented, and IP rights belong to Stanford University.

350 questions
44
votes
4 answers

Getting Google PageRank via an API (PHP)

I have a list of domains and would like to get the: PageRank for all the domains. So just an integer, there must be an API that returns this. The number of results in Google The position of the word of the domainname. For example, "google.com"…
Henk Denneboom
  • 1,573
  • 3
  • 15
  • 18
41
votes
4 answers

Is there a way to programmatically access Google's search engine results?

Does google offer a way to programmatically see their search engine results for a certain query? I want to build a tracking application so that a user can see what rank on the google results their website is for certain keywords. EDIT: The behavior…
Doug
  • 661
  • 2
  • 7
  • 6
24
votes
1 answer

How do weighted edges affect PageRank in networkx?

I'm playing around with networkx (graph library in python) and I found documentation saying the PageRank algorithm takes edge weights into account when scoring, but I was wondering if larger edge weights were better or lower weights better?
Lostsoul
  • 25,013
  • 48
  • 144
  • 239
24
votes
5 answers

Keyword analysis in PHP

For a web application I'm building I need to analyze a website, retrieve and rank it's most important keywords and display those. Getting all words, their density and displaying those is relatively simple, but this gives very skewed results (e.g.…
Jeroen
  • 13,056
  • 4
  • 42
  • 63
22
votes
7 answers

Pagerank and its mathematics: Explanation needed

I am a student interested in developing a search engine that indexes pages from my country. I have been researching algorithms to use for sometime now and I have identified HITS and PageRank as the best out there. I have decided to go with PageRank…
Kennedy
  • 2,146
  • 6
  • 31
  • 44
21
votes
12 answers

Does a "blog" sub-domain help the pagerank of your main site?

I have my main application site https://drchrono.com, and I have a blog sub-domain under http://blog.drchrono.com. I was told by some bloggers that the blog sub-domain of your site helps the pagerank of your main site. Does traffic to your blog…
MikeN
  • 45,039
  • 49
  • 151
  • 227
16
votes
2 answers

How to make numpy array column sum up to 1

I am working on building a transition matrix for implementing the PageRank algorithm. How could I use numpy to make sure that the columns add up to one. For example: 1 1 1 1 1 1 1 1 1 should be normalized to be .33 .33 .33 .33 .33 .33 …
Simon
  • 591
  • 1
  • 7
  • 17
16
votes
1 answer

Networkx: Differences between pagerank, pagerank_numpy, and pagerank_scipy?

Does anyone know about the differences in accuracy between the three different pagerank functions in Networkx? I have a graph of 1000 nodes and 139732 edges, and the "plain" pagerank function didn't seem to work at all -- all but two of the nodes…
wrongusername
  • 18,564
  • 40
  • 130
  • 214
15
votes
2 answers

Wikidata results sorted by something similar to a PageRank

In Wikidata (Wikidata SPARQL endpoint), is there a way to order the SPARQL query results with something like a PageRank? SELECT DISTINCT ?entity ?entityLabel WHERE { ?entity wdt:P31 wd:Q5. SERVICE wikibase:label { bd:serviceParam…
jordipala
  • 193
  • 7
13
votes
1 answer

Using python's networkX to compute personalized page rank

I am trying to build a directed graph and compute personalized page rank over this graph. So suppose I have a graph with vertices {1,2,3,4} and edges going from 2, 3, and 4 to vertex 1, I would like to: (1) compute the personalized page rank of…
xiaolingxiao
  • 4,793
  • 5
  • 41
  • 88
11
votes
5 answers

Getting Good Google PageRank

In SEO people talk a lot about Google PageRank. It's kind of a catch 22 because until your site is actually big and you don't really need search engines as much, it's unlikely that big sites will link to you and increase your PageRank! I've been…
Tyler
  • 3,220
  • 1
  • 30
  • 44
11
votes
2 answers

Implementing PageRank using MapReduce

I'm trying to get my head around an issue with the theory of implementing the PageRank with MapReduce. I have the following simple scenario with three nodes: A B C. The adjacency matrix is here: A { B, C } B { A } The PageRank for B for example is…
Nick D.
  • 111
  • 1
  • 1
  • 4
11
votes
3 answers

How is pagerank calculated in a distributed way?

I understand the the idea behind pagerank and have implemented it(when reading the book "programming collective intelligence"). But I read it could be distributed across several servers(as I guess google is doing). I'm a bit confused because…
Lostsoul
  • 25,013
  • 48
  • 144
  • 239
9
votes
10 answers

How to evaluate a search engine?

I am a student carrying out a study to enhance a search engine's existing algorithm. I want to know how I can evaluate the search engine - which I have improved - to quantify how much the algorithm was improved. How should I go about comparing the…
ahmed
  • 14,316
  • 30
  • 94
  • 127
9
votes
1 answer

What is PageRanks Big-O complexity?

I'm searching for the Big-O complexity of PageRank algorithm. I hardly could found anything, I just found O(n+m) ( n - number of nodes, m - number of arcs/edges) but I didn't believe this complexity by now. I think it is missing the convergence…
Matthias Kricke
  • 4,931
  • 4
  • 29
  • 43
1
2 3
23 24