guide
  • Introduction
  • Guiding Principles
    • Mission Statement
    • Conflict Resolution Process
  • Operating Model
    • Working Together
    • Holacracy
      • Meetings
      • Specific Roles
      • Terms and Definitions
      • Finer Points
      • Holacracy-Asana Key
    • Getting Things Done
      • Daily, Weekly, Monthly, and Annual Reviews
      • GTD-Asana Key
    • Transparency
    • Language
    • Budgeting
    • By Department
      • Engineering Operations
  • General Guidelines
  • Employment Policies
    • Equal Opportunity Employment
    • At-Will Employment
    • Code of Conduct in the Community
    • Complaint Policy
    • Drug and Alcohol Policy
    • Vacation, Holiday, and Paid Time Off (PTO) Policy
    • Supplemental Policies for Remote Employees and Contractors
    • Supplemental Policy for Bonus, Commissions, and other Performance-based Payments
    • Supplemental Policies for Hourly International Contractors or Workers
    • Supplemental Policies for Hourly International Contractors or Workers
    • Disputes and Arbitration
  • Benefits and Perks
    • Health Care
    • Vacation, Holiday and Paid Time Off (PTO) Policy
    • Holiday List
  • Hiring Documents
    • Acknowledgement of Receipt
    • Partner Proprietary Information and Inventions Agreement
  • Engineering Wiki
    • Code Snippets
      • Front End Code Snippets
    • Setup
      • 1: Overview of development using Audienti
      • 2: How to setup your dev environment on Docker
      • 2a: Setting up on our cloud your dev server
      • 3: Connect to Production using the VPN
      • 4: Import data into your development environment
    • Deployment
      • Docker based deployment of back end (manual)
    • Culture
      • How our development team works
      • Code Best Practices
    • Tips
      • Setting up a new development machine
      • Importing data to Development environment
      • GIT workflow and work tracking
      • Using Slack
      • Using Rubocop
      • Our Code Standards
      • General suggested best practices
      • Tracking your time
      • Naming Iterations
    • Migrations
      • Postgres
      • ElasticSearch
      • Redis
    • Database and System Maintenance
      • Redis Howtos
      • Elasticsearch HowTos
      • Postgres HowTos
      • Administration recipes
      • App maintenance crash course notes
    • Front End
      • 2016 Plan
      • Deploy
      • Assets
      • SearchLogic
      • How to create UI components
      • OMA Standard Tables
    • Monitoring and Alerting
      • Monitoring Systems
      • Monitoring individual controller actions
      • Get notified when a metric reaches a certain threshold
      • Instrumenting your models using Oma Stats
      • Configuring Graphite Charts
      • Tracking your results with StatsD
      • Logging Fields
      • Updating Kibana Filtering
    • Testing
      • Coverage
      • Elasticsearch mapping config synchronization
      • Testing Gotchas
      • Rspec Preloader
      • Test Best Practices
    • Models
      • Backlinks
    • Queueing and Worker System
      • Queueing and Job Overview
    • Processors
      • Rebuilding Spot Instances
      • Deploying processors
      • Running processors in development
      • Reverting to the previous build on a failed deployment
    • Processors / Opportunity Pipeline
      • Opportunity Pipeline
      • Diagram
    • Processors / Enrichment Pipeline
      • Diagram
      • Clustering
    • Processors / Backlink Pipeline
      • Diagram
      • Backlink Pipeline external APIs
      • Backlink pipeline logic
    • Processors / Automation Pipeline
      • Diagram
      • Automation Pipeline Overview
      • Agents
      • Running in development
    • Messaging and Social Accounts
      • Overview
    • API
      • Audienti API
    • Algorithms
    • Troubleshooting
      • Elasticsearch
    • Big Data Pipeline Stuff
      • Spark
    • Our Product
      • Feature synopsis of our product
    • Research
      • Backend framework comparison
      • Internet marketing Saas companies
    • Code snippets
      • Commonly Used
      • Not Used
    • Miscellaneous
      • Proxies and Bax
    • Legacy & Deprecated
      • Search criteria component
      • Classes list
      • Target Timeline
      • Twitter processor
      • Asset compilation
      • Test related information
      • Interface to EMR Hadoop jobs
      • Mongo Dex Indexes to be Built
      • Mongodb errors
      • Opportunity pipeline scoring
      • Graph Page
      • Lead scoring
      • Insights
      • Shard keys
      • Setting up OMA on local
      • Clone project to local machine
      • Getting around our servers in AWS
  • Acknowledgements
  • Documents That Receiving Your First Payment Triggers Acknowledgement and Acceptanace
Powered by GitBook
On this page
  • title_words
  • data_source
  • source_url
  • host_match_links_count
  • data
  • link_status
  • market_rank
  • destination_url
  • kind
  • source_host
  • anchor_words
  • title
  • domain_pr
  • backlinks_count
  • locations
  • page_title_array
  • alt
  • juice
  • created_at
  • links_count
  • page_digest
  • follow
  • tags
  • match_type
  • status
  • path_match_links_count
  • domain_id
  • destination digest
  • source_digest
  • destination host
  • code
  • link_value
  • image_link
  • anchor_text
  • ip_address
  • updated_at
  • destination_page_code
  • page_pr
  • page_title
  1. Engineering Wiki
  2. Models

Backlinks

(by nicholas, last update june 2013)

title_words

Words appearing in the title of the backlink page

data_source

Provider this data is acquired from

  • Ahrefs

  • SeoMoz

  • Blekko (not implemented)

  • Faroo (not implemented)

source_url

Source of backlink

host_match_links_count

no idea

data

nested hash containing the backlink id again (duplicate information) and a mirrored_at timestamp

link_status

possible values:

  • live

  • missing Not sure what this represents

market_rank

Marketfu rank TODO: ask william for formula

destination_url

Destination url for the backlink

kind

Auto-associated tag

  • strategic

  • relevance

  • momentum

  • missing TODO: ask william for formula

source_host

source host

anchor_words

array of words that occur in the anchor text

TODO: deduplicate before saving

title

anchor-tag title attribute

domain_pr

page rank for the backlink domain

backlinks_count

backlinks count of the backlink page

locations

section the backlink is found in. Not sure if this data is still gathered.

  • in_body

  • in_footer

  • in_form

  • in_frame

  • in_header

  • in_javascript

  • in_list

  • in_nav

  • in_sidebar

page_title_array

words in the page title

alt

anchor-tag alt attribute

juice

Custom juice calculation for the page (inbound_links/outbound_links)

created_at

creation time in our system

links_count

total amount of link on the page

page_digest

hash of page body (cleansed)

follow

anchor-tag nofollow attribute. follow:true means that the nofollow attribute isn't set.

tags

Page classification

  • momentum

  • shopping

  • ecommerce

  • social

  • relevance

  • comments

match_type

match type

  • path

  • domain (not implemented) Not sure what the purpose is TODO: check with william

status

Active/Inactive

not sure what it means

path_match_links_count

links that match your path? not sure

domain_id

data belongs to this domain

destination digest

hash of destination body?

source_digest

hash of source page TODO: it's not a duplicate of page_digest so what's the difference?

destination host

destination host

code

status code for the source_url? TODO: clarify this

link_value

TODO: check with william for formula

image_link

boolean to indicate if this is an image link

anchor_text

contents of anchor tag from the backlink

ip_address

ip address of the source_host

updated_at

timestamps last updated in our system

destination_page_code

status code of the destination page

page_pr

page rank for the source page

page_title

page title for the source page

PreviousModelsNextQueueing and Worker System

Last updated 7 years ago