planetautor5.wpcomstaging.com
robots.txt

Robots Exclusion Standard data for planetautor5.wpcomstaging.com

Resource Scan

Scan Details

Site Domain planetautor5.wpcomstaging.com
Base Domain wpcomstaging.com
Scan Status Ok
Last Scan2024-09-25T12:43:20+00:00
Next Scan 2024-10-25T12:43:20+00:00

Last Scan

Scanned2024-09-25T12:43:20+00:00
URL https://planetautor5.wpcomstaging.com/robots.txt
Domain IPs 192.0.78.20
Response IP 192.0.78.20
Found Yes
Hash d03b9aedcbcc1837b8a485bc8333413b32a0ccd55ffebbfb9152ae3d7d837960
SimHash 4805c8c2a402

Groups

termlybot

Rule Path
Allow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

*

Rule Path
Disallow /?s=
Disallow /page/*/?s=
Disallow /search/
Disallow /wp-json/
Disallow /?rest_route=

adsbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://planetautor5.blog/sitemap.xml
sitemap https://planetautor5.blog/news-sitemap.xml
sitemap https://planetautor5.blog/sitemap_index.xml

Comments

  • Termly scanner
  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK