wghn.com
robots.txt

Robots Exclusion Standard data for wghn.com

Resource Scan

Scan Details

Site Domain wghn.com
Base Domain wghn.com
Scan Status Ok
Last Scan2024-11-14T04:34:21+00:00
Next Scan 2024-11-21T04:34:21+00:00

Last Scan

Scanned2024-11-14T04:34:21+00:00
URL https://wghn.com/robots.txt
Domain IPs 54.84.131.112
Response IP 54.84.131.112
Found Yes
Hash cbe118c6a66dfc93326992eb119bf10ddc4c0737bbeefe91a05b0e8677f012d6
SimHash f380d6604b22

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

*

Rule Path
Disallow /login/forgotPassword
Disallow /login/forgotPassword/
Disallow /site/adUnit
Disallow /site/adUnit/
Disallow /site/trafficMap
Disallow /site/trafficMap/
Disallow /wpBlogNewsService/logView
Disallow /wpBlogNewsService/logView/
Disallow /search
Disallow /search/

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://wghn.com/sitemap.xml

Comments

  • SoCast
  • socast-elasticsearch-sitemap