bruneitribune.com
robots.txt

Robots Exclusion Standard data for bruneitribune.com

Resource Scan

Scan Details

Site Domain bruneitribune.com
Base Domain bruneitribune.com
Scan Status Ok
Last Scan5/29/2025, 6:04:19 AM
Next Scan 6/28/2025, 6:04:19 AM

Last Scan

Scanned5/29/2025, 6:04:19 AM
URL https://bruneitribune.com/robots.txt
Domain IPs 104.21.18.103, 172.67.181.148, 2606:4700:3030::ac43:b594, 2606:4700:3031::6815:1267
Response IP 172.67.181.148
Found Yes
Hash 59df4527da0e79bc791f3533a8208270bd349f647eb86bf97f233f48f587479a
SimHash 62344dd2ce89

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-content/
Disallow /wp-includes/
Disallow /go/
Disallow /comments/feed/
Disallow /trackback/
Disallow /index.php
Disallow /xmlrpc.php
Disallow /search?
Disallow /?p=*

mediapartners-google*

Rule Path
Allow /

googlebot-image

Rule Path
Allow /wp-content/uploads/

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5200

marketwirebot

Rule Path
Disallow /

femtosearchbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

gnowitnewsbot

Rule Path
Disallow /

Other Records

Field Value
sitemap http://cambodiatribune.com/sitemap.xml.gz

Warnings

  • 1 invalid line.