thealternativepress.com
robots.txt

Robots Exclusion Standard data for thealternativepress.com

Resource Scan

Scan Details

Site Domain thealternativepress.com
Base Domain thealternativepress.com
Scan Status Ok
Last Scan2024-11-06T17:28:18+00:00
Next Scan 2024-11-13T17:28:18+00:00

Last Scan

Scanned2024-11-06T17:28:18+00:00
URL http://thealternativepress.com/robots.txt
Redirect https://tapinto.net/robots.txt
Redirect Domain tapinto.net
Redirect Base tapinto.net
Domain IPs 35.173.112.78
Redirect IPs 35.173.112.78
Response IP 35.173.112.78
Found Yes
Hash 4c4abca0b7829935977f6cffb6de2cd8010fcdbdd9c2f53c3e26651147015750
SimHash 621468554751

Groups

*

Rule Path
Disallow /admin
Disallow /users/sign_in
Disallow /users
Disallow /transactions
Disallow /article.asp
Disallow /column.asp
Disallow /photos
Disallow /tracked_urls
Disallow /search*
Disallow /towns.json
Disallow /category.json
Disallow /*.php
Disallow /*.php$
Disallow /api/v1/masterhead*
Disallow /newsletters
Disallow /subscriptions
Disallow /towns/*/games/*

googlebot

Rule Path
Disallow /admin
Disallow /users/sign_in
Disallow /users
Disallow /transactions
Disallow /photos
Disallow /tracked_urls
Disallow /article.asp*
Disallow /column.asp*
Disallow /admin
Disallow /users/sign_in
Disallow /transactions
Disallow /article.asp
Disallow /column.asp
Disallow /photos
Disallow /tracked_urls
Disallow /search*
Disallow /newsletters
Disallow /subscriptions

googlebot-image

Rule Path
Allow /*
Disallow /admin
Disallow /users/sign_in
Disallow /users
Disallow /transactions
Disallow /article.asp*
Disallow /column.asp*
Disallow /search*

adsbot-google

Rule Path
Allow /*
Disallow /admin
Disallow /users/sign_in
Disallow /users
Disallow /transactions
Disallow /article.asp*
Disallow /column.asp*
Disallow /search*

linkedinbot/1.0

Rule Path
Allow /

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefsbot

Rule Path
Disallow /

ahrefssiteaudit

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.tapinto.net/sitemaps/sitemap-altpress-index.xml

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • Disallow: /
  • slow down Yahoo
  • slow down Bing