Backlinks
(by nicholas, last update june 2013)
title_words
Words appearing in the title of the backlink page
data_source
Provider this data is acquired from
Ahrefs
SeoMoz
Blekko (not implemented)
Faroo (not implemented)
source_url
Source of backlink
host_match_links_count
no idea
data
nested hash containing the backlink id again (duplicate information) and a mirrored_at timestamp
link_status
possible values:
live
missing Not sure what this represents
market_rank
Marketfu rank TODO: ask william for formula
destination_url
Destination url for the backlink
kind
Auto-associated tag
strategic
relevance
momentum
missing TODO: ask william for formula
source_host
source host
anchor_words
array of words that occur in the anchor text
TODO: deduplicate before saving
title
anchor-tag title attribute
domain_pr
page rank for the backlink domain
backlinks_count
backlinks count of the backlink page
locations
section the backlink is found in. Not sure if this data is still gathered.
in_body
in_footer
in_form
in_frame
in_header
in_javascript
in_list
in_nav
in_sidebar
page_title_array
words in the page title
alt
anchor-tag alt attribute
juice
Custom juice calculation for the page (inbound_links/outbound_links)
created_at
creation time in our system
links_count
total amount of link on the page
page_digest
hash of page body (cleansed)
follow
anchor-tag nofollow attribute. follow:true means that the nofollow attribute isn't set.
tags
Page classification
momentum
shopping
ecommerce
social
relevance
comments
match_type
match type
path
domain (not implemented) Not sure what the purpose is TODO: check with william
status
Active/Inactive
not sure what it means
path_match_links_count
links that match your path? not sure
domain_id
data belongs to this domain
destination digest
hash of destination body?
source_digest
hash of source page TODO: it's not a duplicate of page_digest so what's the difference?
destination host
destination host
code
status code for the source_url? TODO: clarify this
link_value
TODO: check with william for formula
image_link
boolean to indicate if this is an image link
anchor_text
contents of anchor tag from the backlink
ip_address
ip address of the source_host
updated_at
timestamps last updated in our system
destination_page_code
status code of the destination page
page_pr
page rank for the source page
page_title
page title for the source page
Last updated