catlangu.com
robots.txt

Robots Exclusion Standard data for catlangu.com

Resource Scan

Scan Details

Site Domain catlangu.com
Base Domain catlangu.com
Scan Status Ok
Last Scan2026-02-25T09:43:13+00:00
Next Scan 2026-02-26T09:43:13+00:00

Last Scan

Scanned2026-02-25T09:43:13+00:00
URL https://catlangu.com/robots.txt
Domain IPs 104.21.84.165, 172.67.195.61, 2606:4700:3030::6815:54a5, 2606:4700:3036::ac43:c33d
Response IP 172.67.195.61
Found Yes
Hash 7960381dcaa1fd9430146332a81ad0f285c5632fd9d9528e05bbe5c21480444f
SimHash 6804998145b2

Groups

google-adstxt

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

googlebot

Rule Path
Disallow

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://catlangu.com/sitemap_index.xml

Comments

  • Allow Google ads crawlers to read app-ads.txt
  • Default rules for all other crawlers