gcaptain.com
robots.txt

Robots Exclusion Standard data for gcaptain.com

Resource Scan

Scan Details

Site Domain gcaptain.com
Base Domain gcaptain.com
Scan Status Ok
Last Scan2024-11-03T20:31:36+00:00
Next Scan 2024-11-10T20:31:36+00:00

Last Scan

Scanned2024-11-03T20:31:36+00:00
URL https://gcaptain.com/robots.txt
Domain IPs 104.26.8.122, 104.26.9.122, 172.67.69.10, 2606:4700:20::681a:87a, 2606:4700:20::681a:97a, 2606:4700:20::ac43:450a
Response IP 172.67.69.10
Found Yes
Hash 5ad2fd6e71a6d394a3e51ad13046a2198847821ff79cff56a768e4a0dfd0da5f
SimHash b8284c808e92

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /?*
Disallow /*/?*

Other Records

Field Value
sitemap http://gcaptain.com/sitemap_index.xml

Warnings

  • 1 invalid line.