gcarian.com
robots.txt

Robots Exclusion Standard data for gcarian.com

Resource Scan

Scan Details

Site Domain gcarian.com
Base Domain gcarian.com
Scan Status Ok
Last Scan2025-06-23T06:13:12+00:00
Next Scan 2025-06-30T06:13:12+00:00

Last Scan

Scanned2025-06-23T06:13:12+00:00
URL https://gcarian.com/robots.txt
Redirect https://www.gcarian.com/robots.txt
Redirect Domain www.gcarian.com
Redirect Base gcarian.com
Domain IPs 104.21.46.98, 172.67.137.89, 2606:4700:3033::ac43:8959, 2606:4700:3035::6815:2e62
Redirect IPs 104.21.46.98, 172.67.137.89, 2606:4700:3033::ac43:8959, 2606:4700:3035::6815:2e62
Response IP 172.67.137.89
Found Yes
Hash 53a18cae15615fe51b22090f0428de8f145b9e4420989aa317a9e15e91a89d6f
SimHash 6b4459c62231

Groups

*

Rule Path
Allow /wp-admin/admin-ajax.php
Allow /*/*.css
Allow /*/*.js
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /readme.html
Disallow /license.txt
Disallow /xmlrpc.php
Disallow /wp-login.php
Disallow /wp-register.php
Disallow *?attachment_id=
Disallow /*~*
Disallow /*~

googlebot

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-image

Rule Path
Allow /wp-content/uploads/

googlebot-video

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

adsbot-google-mobile

Rule Path
Allow /

baiduspider

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /wp-content/uploads/

Other Records

Field Value
sitemap https://www.gcarian.com/sitemap_index.xml