embassyofcambodiadc.org
robots.txt

Robots Exclusion Standard data for embassyofcambodiadc.org

Resource Scan

Scan Details

Site Domain embassyofcambodiadc.org
Base Domain embassyofcambodiadc.org
Scan Status Ok
Last Scan2025-11-19T23:01:35+00:00
Next Scan 2025-12-19T23:01:35+00:00

Last Scan

Scanned2025-11-19T23:01:35+00:00
URL https://embassyofcambodiadc.org/robots.txt
Redirect https://www.embassyofcambodiadc.org/robots.txt
Redirect Domain www.embassyofcambodiadc.org
Redirect Base embassyofcambodiadc.org
Domain IPs 199.34.228.78
Redirect IPs 199.34.228.78
Response IP 199.34.228.78
Found Yes
Hash d3b2d3a846f837717f1d8fe08d75659c6dcbcf548817fa882be88c0e0ffc2cad
SimHash d144d4446ed9

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/
Disallow /https%3A//mfaic.gov.kh/home/linktoministeries
Disallow /https%3A//www.mfaic.gov.kh/link-to-organizations
Disallow /cambodia.html
Disallow /https%3A//kh.usembassy.gov/
Disallow /consular.html
Disallow /https%3A//www.tourismcambodia.org/
Disallow /embassy-updates1.html
Disallow /other-links.html

Other Records

Field Value
sitemap https://www.embassyofcambodiadc.org/sitemap.xml