thegearpage.net
robots.txt

Robots Exclusion Standard data for thegearpage.net

Resource Scan

Scan Details

Site Domain thegearpage.net
Base Domain thegearpage.net
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-14T15:16:22+00:00
Next Scan 2024-12-13T15:16:22+00:00

Last Successful Scan

Scanned2023-08-20T08:52:50+00:00
URL https://thegearpage.net/robots.txt
Redirect https://www.thegearpage.net/robots.txt
Redirect Domain www.thegearpage.net
Redirect Base thegearpage.net
Domain IPs 104.21.56.32, 172.67.176.29, 2606:4700:3031::ac43:b01d, 2606:4700:3037::6815:3820
Redirect IPs 104.21.56.32, 172.67.176.29, 2606:4700:3031::ac43:b01d, 2606:4700:3037::6815:3820
Response IP 104.21.56.32
Found Yes
Hash f25f08ff797cdb7bb36d7d9f676bbe94be9107a0356dc1ad21cbeb0b93367f52
SimHash d3926377c734

Groups

*

Rule Path
Disallow /*?css.php*
Disallow /*?find-new%2F*
Disallow /*?reports%2F*
Disallow /*?account%2F*
Disallow /*?login%2F*
Disallow /*?trade%2F*
Disallow /*?register%2F*
Disallow /*?trending%2F*
Disallow /*?forumdisplay.php*

Other Records

Field Value
crawl-delay 1

titan

Rule Path
Disallow

emailcollector

Rule Path
Disallow

emailsiphon

Rule Path
Disallow

emailwolf

Rule Path
Disallow

extractorpro

Rule Path
Disallow

webzip

Rule Path
Disallow

larbin

Rule Path
Disallow

b2w/0.1

Rule Path
Disallow

htdig/3.1.5

Rule Path
Disallow

teleport

Rule Path
Disallow

npbot

Rule Path
Disallow

turnitinbot

Rule Path
Disallow

dloader(naverrobot)

Rule Path
Disallow

dloader(speedy spider)

Rule Path
Disallow

funwebproducts

Rule Path
Disallow

webstripper

Rule Path
Disallow

websauger

Rule Path
Disallow

webcopier

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.thegearpage.net/board/sitemap.php

Comments

  • vBulletin stuff
  • Allow all for Amazon Recommendation Ads
  • User-agent: Mozilla/5.0 (compatible;contxbot/1.0)
  • Disallow:
  • disallow nefarious 'bots and hope they comply