commandocomics.com
robots.txt

Robots Exclusion Standard data for commandocomics.com

Resource Scan

Scan Details

Site Domain commandocomics.com
Base Domain commandocomics.com
Scan Status Ok
Last Scan2024-06-12T19:58:10+00:00
Next Scan 2024-06-19T19:58:10+00:00

Last Scan

Scanned2024-06-12T19:58:10+00:00
URL https://commandocomics.com/robots.txt
Redirect https://www.commandocomics.com/robots.txt
Redirect Domain www.commandocomics.com
Redirect Base commandocomics.com
Domain IPs 2a12:5240::1, 89.106.200.1
Redirect IPs 104.18.28.20, 104.18.29.20, 2606:4700::6812:1c14, 2606:4700::6812:1d14
Response IP 104.18.29.20
Found Yes
Hash 0050293bb87161bf498c257bdeda3e1592e6160f70c0d99c88b28849af22ce26
SimHash 18675a40b193

Groups

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin*
Disallow *s%3Dfeed
Disallow */?s&amp%3B*
Disallow */?s=*
Disallow *s%3D*
Disallow /search/*
Disallow /search?q=*
Disallow /?filter*
Disallow *?share=*

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.commandocomics.com/sitemap_index.xml

Comments

  • network
  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK