whbl.com
robots.txt

Robots Exclusion Standard data for whbl.com

Resource Scan

Scan Details

Site Domain whbl.com
Base Domain whbl.com
Scan Status Ok
Last Scan2024-09-22T16:22:32+00:00
Next Scan 2024-09-29T16:22:32+00:00

Last Scan

Scanned2024-09-22T16:22:32+00:00
URL https://whbl.com/robots.txt
Domain IPs 54.84.131.112
Response IP 54.84.131.112
Found Yes
Hash 99849fc6efb248d4327b6ca2204993f71fc8143d86df830610e07f42b7576ef3
SimHash e3a0d6404ba0

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

*

Rule Path
Disallow /login/forgotPassword
Disallow /login/forgotPassword/
Disallow /site/adUnit
Disallow /site/adUnit/
Disallow /site/trafficMap
Disallow /site/trafficMap/
Disallow /wpBlogNewsService/logView
Disallow /wpBlogNewsService/logView/
Disallow /search
Disallow /search/

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://whbl.com/sitemap.xml

Comments

  • SoCast
  • socast-elasticsearch-sitemap