cambridgeinternational.org
robots.txt

Robots Exclusion Standard data for cambridgeinternational.org

Resource Scan

Scan Details

Site Domain cambridgeinternational.org
Base Domain cambridgeinternational.org
Scan Status Ok
Last Scan2024-10-30T01:35:40+00:00
Next Scan 2024-11-29T01:35:40+00:00

Last Scan

Scanned2024-10-30T01:35:40+00:00
URL https://cambridgeinternational.org/robots.txt
Redirect https://www.cambridgeinternational.org/robots.txt
Redirect Domain www.cambridgeinternational.org
Redirect Base cambridgeinternational.org
Domain IPs 192.149.119.103
Redirect IPs 192.149.119.103
Response IP 192.149.119.103
Found Yes
Hash 3278a4521061cc3dbf2abdaea9020c58d8c16eda0772b24a67b860051c60bbae
SimHash c904cc0607d0

Groups

*

Rule Path
Disallow /beta.cie.org.uk/
Disallow /prd.cie.org.uk/
Disallow /images/310861-cambridge-appeals-regulations-and-guidance.pdf
Disallow /sitemap/site-map-hidden.aspx
Disallow /Images/cambridge-samples-database.xls
Disallow /images/cambridge-samples-database.xls
Disallow /Images/168168-cambridge-guide-to-making-entries-march-series.pdf
Disallow /uzbekistan-university-admissions-2021/
Disallow /Images/635441-uzbekistan-university-admissions-2021-list.pdf
Disallow /why-choose-us/information-for-schools-in-indonesia/
Disallow /covid/portfolio-of-evidence/results-and-enquiries-about-results/guidance-for-schools/
Disallow /covid/june-2023-exam-series/running-exams/supporting-schools-in-china/

Other Records

Field Value
sitemap http://www.cambridgeinternational.org/sitemap_seo.xml