waymarking.com
robots.txt

Robots Exclusion Standard data for waymarking.com

Resource Scan

Scan Details

Site Domain waymarking.com
Base Domain waymarking.com
Scan Status Ok
Last Scan2024-11-16T09:22:27+00:00
Next Scan 2024-11-23T09:22:27+00:00

Last Scan

Scanned2024-11-16T09:22:27+00:00
URL https://waymarking.com/robots.txt
Domain IPs 63.251.163.208
Response IP 63.251.163.208
Found Yes
Hash 42517cb86ddb880e323479c21c3dd4fed1bb6f35536ac6844aae610ce5113be2
SimHash 2a11d76287b5

Groups

dotbot

Rule Path
Disallow /

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

twiceler

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

scoutjet

Rule Path
Disallow /

fasterfox

Rule Path
Disallow /

*

Rule Path
Disallow /admin/
Disallow /users/
Disallow /login/
Disallow /hunt/
Disallow /gallery/
Disallow /my/
Disallow /gallery/
Disallow /images/
Disallow /cat/rss.aspx
Disallow /cat/details.aspx
Disallow /wm/rss.aspx
Disallow /wm/search.aspx

Comments

  • robots.txt in staging is managed by an IT irule