whtc.com
robots.txt

Robots Exclusion Standard data for whtc.com

Resource Scan

Scan Details

Site Domain whtc.com
Base Domain whtc.com
Scan Status Ok
Last Scan2024-11-01T06:17:46+00:00
Next Scan 2024-11-08T06:17:46+00:00

Last Scan

Scanned2024-11-01T06:17:46+00:00
URL https://whtc.com/robots.txt
Domain IPs 54.84.131.112
Response IP 54.84.131.112
Found Yes
Hash 339451b9754a85b475e8481823bbd41dbc13284398c27be030554b21d1d33124
SimHash e3a0de6043a0

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

*

Rule Path
Disallow /login/forgotPassword
Disallow /login/forgotPassword/
Disallow /site/adUnit
Disallow /site/adUnit/
Disallow /site/trafficMap
Disallow /site/trafficMap/
Disallow /wpBlogNewsService/logView
Disallow /wpBlogNewsService/logView/
Disallow /search
Disallow /search/

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://whtc.com/sitemap.xml

Comments

  • SoCast
  • socast-elasticsearch-sitemap