Betweenness Centrality

Original Paper(Cached)

A few review of the definition (also the same with the paper):

Let \(G = (V,E)\) be a graph that is undirectional, weighted, no multiple edges, connected.

Let \(V\) be the set of vertices, \(E\), the set of all edges

Let \(\omega\) be the weight function such that \(\omega(e) > 0 \text{ for all } e \in E\).

Let \(d_G(s,t)\) be the minimum distance from \(s\) to \(t\)(\(s,t \in V\)), \(d_G(s,s)=0\)

Let \(P_s(v) = \{ u \in V : \{u,v\} \in E,\, d_G(s,v) = d_G(s,u) + \omega(\{u,v\})\}\)

\(P_s(v)\) can be perceived as: if you go first from v to s, what point is best to choose in the first step.

It is clear that \(P_s(v) = \emptyset \iff v = s\)

Let \(\sigma_{st}\) be the number of shortest paths from \(s\) to \(t\). Define \(\sigma_{ss} = 1\)

Let \(\sigma_{st}(v)\) be the number of shortest paths from \(s\) to \(t\) containging \(v\).
Let \(\sigma_{st}(v,e)\) be the number of shortest paths from \(s\) to \(t\) containging \(v\) and \(e\).

\[\sigma_{st}(v) = \begin{cases} 0 & \text{ if } d_G(s,t) < d_G(s,v) + d_G(v,t) \\ \sigma_{sv} \cdot \sigma_{vt} & \text{otherwise} \end{cases}\]

So it is clear that \(\sigma_{st}(s) = \sigma_{st}(t) = \sigma_{st}\) and \(\sigma_{ss}(s) = 1\) and \(\sigma_{ss}(s') = 0 \text{ where } s'\neq s\)

Let \(\delta_{st}(v)\) be \(\frac{\sigma_{st}(v)}{\sigma_{st}}\).
Let \(\delta_{st}(v,e)\) be \(\frac{\sigma_{st}(v,e)}{\sigma_{st}}\).

So it is clear that \(\delta_{st}(s) = \delta_{st}(t) = 1\)

Common definition of Centrality

\[\begin{aligned} C_C(v) = \frac{1}{\sum_{t\in V}{d_G(v,t)}} & & \text { closeness centrality } \\ C_G(v) = \frac{1}{\max_{t\in V}{d_G(v,t)}} & & \text { graph centrality } \\ C_S(v) = \sum_{s,v,t \in V, s\neq v \neq t}{\sigma_{st}(v)} & & \text { stress centrality } \\ C_B(v) = \sum_{s,v,t \in V, s\neq v \neq t}{\frac{\sigma_{st}(v)}{\sigma_{st}}} & & \text { betweenness centrality } \\ \end{aligned}\]

Define the set of predecessors of a vertex v on shortest paths from s as

\(P_s(v) = \{u ∈ V : \{u, v\} ∈ E, d_G(s, v) = d_G (s, u) + ω(u, v)\}\).

Lemma 1: (Bellman criterion)

\[\sigma_{st}(v) > 0 \iff d_G(s,t) = d_G(s,v) + d_G(v,t)\]

Lemma 2: (Combinatorial shortest-path counting)

If \(s,v \in V,\, s \neq v\), then

\[\sigma_{sv} = \sum_{u\in P_s(v)}{\sigma_{su}}\]

Errors in the original paper.

In Page 6 the author define \(\delta_{s\bullet}(v)\) as:

\[\delta_{s\bullet}(v) = \sum_{t\in V}{ \delta_{st}(v) }\]
  • Lemma 5 If there is exactly one shortest path from \(s ∈ V\) to each \(t ∈ V\), the dependency of \(s\) on any \(v ∈ V\) obeys

    \[δs• (v) = \sum{w : v∈P_s(w)}(1 + δs•(w))\]
    • Counter example

      Suppose a graph with 2 vertices (\(s\) and \(t\)) and 1 edge connecting them.

      • By definition \(δs• (t) = δss(t) + δst(t) = 1\)
      • By Lemma \(δs• (t) = \sum_{w : t∈P_s(w)}(1 + δs•(w)) = \sum_{w:\emptyset}{\cdots} = 0\)

      It is empty set because there are no such \(w\) that \(t\in P_s(w)\)

  • In the deduction of page 8, it says:

    \[δs• (v) = \sum_{t∈V} δst (v) = \sum_{t∈V}{ \sum_{w : v∈P_s(w)}{ δst (v, {v, w})}} = \sum_{w : v∈P_s(w)} {\sum_{t∈V} {δst (v, \{v, w\})}}\]
    • Counter example:

      Suppose a graph with 2 vertices (\(s\) and \(t\)) and 1 edge connecting them.

      • By definition \(δs• (t) = δss(t) + δst(t) = 1\)
      • By this claim \(δs• (t) = \sum_{w : v∈P_s(w)}{\sum_{t∈V} {δst (v, \{v, w\})}} = \sum_{w : \emptyset}{\cdots} = 0\)
  • In page 8, it says:

    \[δst (v, \{v, w\}) = \begin{cases} \frac{σ_{sv}}{σ_{sw}} & \text{if } t = w \\ \frac{σ_{sv}}{σ_{sw}} \cdot \frac{σ_{st}(w)}{σ_{st}} & \text {if } t \neq w \end{cases}\]

    It is redundant because, if \(t = w\), then \(\frac{σ_{st}(w)}{σ_{st}} = 1\), it is just

    \[δst (v, \{v, w\}) = \frac{σ_{sv}}{σ_{sw}} \cdot \frac{σ_{st}(w)}{σ_{st}}\]
    • In page 8, it says:
    \[\begin{aligned} \sum_{w:v\in P_s(w)}{\sum_{t\in V} {\delta_{st}(v, \{v,w\})}} &= \sum_{w:v\in P_s(w)} \left( \frac{\sigma_{sv}}{\sigma_{sw}} + \color{red}{\sum_{t\in V\setminus\{w\}} \frac{σsv}{σsw} \cdot \frac{σst(w)}{σst}} \right) \\ &= \sum_{w:v\in P_s(w)}{\frac{σ_{sv}}{σ_{sw}}\cdot ( 1 + \color{red}{\delta_{s\bullet}(w)})} \end{aligned}\]

    The two parts in red color doesn’t follow the definition.

    • It is actually is:
    \[\begin{aligned} \sum_{w:v\in P_s(w)}{\sum_{t\in V} {\delta_{st}(v, \{v,w\})}} &= \sum_{w:v\in P_s(w)} \left( {\sum_{t\in V} \frac{σ_{sv}}{σ_{sw}} \cdot \frac{σ_{st}(w)}{σ_{st}}} \right) \\ &= \sum_{w:v\in P_s(w)}{\frac{σ_{sv}}{σ_{sw}}\cdot ( \delta_{s\bullet}(w))} \end{aligned}\]

Corrections:

Definition:

\[\delta_{s\bullet}(v) = \sum_{t\in V, t\neq s \neq v}{ \delta_{st}(v) }\]

We have:

\[\begin{aligned} \delta_{s\bullet}(v) &= \sum_{t\in V, t\neq s \neq v}{ \left( \delta_{st}(v) \right) } \\ &= \sum_{t\in V, t\neq s \neq v}{ \left( \frac{\sigma_{st}(v)}{\sigma_{st}} \right) } \\ &= \sum_{t\in V, t\neq s \neq v}{ \left( \frac{\sum_{w:v\in P_s(w)}{\sigma_{st}(v,\{v,w\})}}{\sigma_{st}} \right) } \\ \end{aligned}\]

Note in the parenthesis above that we have

\[\sigma_{st}(v, \{v,w\}) = \frac{\sigma_{sv}}{\sigma_{sw}} \cdot \sigma_{st}(w)\]

Why?.The definition of \(w\) means that \(v\) is closer to \(s\) than \(w\). and \(\sigma_{st}(v, \{v,w\}) = \sigma_{st}(w, \{v,w\})\). So the equation means that of all the paths from \(s\) to \(t\) via \(w\), which is \(\sigma_{st}(w)\), we take a fraction by only allowing one way (a.k.a \(\{v,w\}\)) of entering \(w\). We had \(\sigma_{sw}\) ways of entering \(w\) but now we have only \(\sigma_{sv}\). Hence the total possible ways is also fractured by \(\frac{\sigma_{sv}}{\sigma_{sw}}\).

Continue the equation:

\[\begin{aligned} \delta_{s\bullet}(v) &= \sum_{t\in V, t\neq s \neq v}{ \left( \delta_{st}(v) \right) } \\ &= \sum_{t\in V, t\neq s \neq v}{ \left( \frac{\sigma_{st}(v)}{\sigma_{st}} \right) } \\ &= \sum_{t\in V, t\neq s \neq v}{ \left( \frac{\sum_{w:v\in P_s(w)}{\sigma_{st}(v,\{v,w\})}}{\sigma_{st}} \right) } \\ &= \sum_{t\in V, t\neq s \neq v}{ \left( \frac{\sum_{w:v\in P_s(w)}{\frac{\sigma_{sv}}{\sigma_{sw}} \cdot \sigma_{st}(w)}}{\sigma_{st}} \right) } \\ &= \sum_{t\in V, t\neq s \neq v}{ \sum_{w:v\in P_s(w)} { \left( \frac{\sigma_{sv}}{\sigma_{sw}} \cdot \delta_{st}(w) \right) }} \\ &= \sum_{w:v\in P_s(w)} { \sum_{t\in V, t\neq s \neq v} { \left( \frac{\sigma_{sv}}{\sigma_{sw}} \cdot \delta_{st}(w) \right) }} \\ &= \sum_{w:v\in P_s(w)} \left( \frac{\sigma_{sv}}{\sigma_{sw}} \cdot { \sum_{t\in V, t\neq s \neq v} { \left( \delta_{st}(w) \right) }} \right) \\ &= \sum_{w:v\in P_s(w)} \left( \frac{\sigma_{sv}}{\sigma_{sw}} \cdot \left( { \left( \sum_{t\in V, t\neq s \neq w} { \delta_{st}(w) } \right) } + \delta_{sw}(w) - \delta_{sv}(w) \right) \right) \\ &= \sum_{w:v\in P_s(w)} \left( \frac{\sigma_{sv}}{\sigma_{sw}} \cdot ( \delta_{s\bullet}(w) + 1 ) \right) \end{aligned}\]