cdac.org.sg
robots.txt
Robots Exclusion Standard data for cdac.org.sg
Resource Scan
Scan Details
| Site Domain | cdac.org.sg |
| Base Domain | cdac.org.sg |
| Scan Status | Ok |
| Last Scan | 2026-02-18T13:39:20+00:00 |
| Next Scan | 2026-03-20T13:39:20+00:00 |
Last Scan
| Scanned | 2026-02-18T13:39:20+00:00 |
| URL | https://cdac.org.sg/robots.txt |
| Domain IPs | 104.20.34.43, 172.66.175.182, 2606:4700:10::6814:222b, 2606:4700:10::ac42:afb6 |
| Response IP | 104.20.34.43 |
| Found | Yes |
| Hash | 3c44e348f19e047c62cc9841547e7d7b3da38c027f4f058e7700853d418c5fd8 |
| SimHash | 34200d43d0d4 |
Groups
ai2bot
ai2bot-dolma
aihitbot
amazonbot
anthropic-ai
applebot-extended
bytespider
ccbot
claudebot
cohere-ai
cohere-training-data-crawler
duckassistbot
facebookbot
google-extended
googleother
googleother-image
googleother-video
gptbot
img2dataset
meta-externalagent
mycentralaiscraperbot
omgili
omgilibot
quora-bot
tiktokspider
youbot
adsbot-google
adsbot-google-mobile
adsbot-google-mobile-apps
*
| Rule | Path |
|---|---|
| Disallow | /config |
| Disallow | /search |
| Disallow | /account$ |
| Disallow | /account/ |
| Disallow | /commerce/digital-download/ |
| Disallow | /api/ |
| Allow | /api/ui-extensions/ |
| Disallow | /static/ |
| Disallow | /*?author=* |
| Disallow | /*%26author%3D* |
| Disallow | /*?tag=* |
| Disallow | /*%26tag%3D* |
| Disallow | /*?month=* |
| Disallow | /*%26month%3D* |
| Disallow | /*?view=* |
| Disallow | /*%26view%3D* |
| Disallow | /*?format=json |
| Disallow | /*%26format%3Djson |
| Disallow | /*?format=page-context |
| Disallow | /*%26format%3Dpage-context |
| Disallow | /*?format=main-content |
| Disallow | /*%26format%3Dmain-content |
| Disallow | /*?format=json-pretty |
| Disallow | /*%26format%3Djson-pretty |
| Disallow | /*?format=ical |
| Disallow | /*%26format%3Dical |
| Disallow | /*?reversePaginate=* |
| Disallow | /*%26reversePaginate%3D* |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.cdac.org.sg/sitemap.xml |
Comments