arlingtontransit.com
robots.txt

Robots Exclusion Standard data for arlingtontransit.com

Resource Scan

Scan Details

Site Domain arlingtontransit.com
Base Domain arlingtontransit.com
Scan Status Ok
Last Scan2025-09-20T11:59:35+00:00
Next Scan 2025-10-20T11:59:35+00:00

Last Scan

Scanned2025-09-20T11:59:35+00:00
URL https://arlingtontransit.com/robots.txt
Domain IPs 104.153.195.182
Response IP 104.153.195.182
Found Yes
Hash 4b23cfba81d8921b6161df62aad276b31af81eefe924f0daf305b54d17fde80a
SimHash cb3c5d62ee68

Groups

*

Rule Path
Disallow /admin/
Disallow /tasks/
Disallow /core/
Disallow /config/
Disallow /sites/cfd/includes/themes/cfd/includes/display_objects/custom/transitinfostructure/
Disallow /tools-resources/nova-transit-schedules/schedule-change-notification/
Disallow /tools-resources/nova-point-to-point-schedules/
Disallow /routes-schedules/schedules/point-to-point-schedules/
Disallow /routes-schedules/schedules/schedule-change-notification/

Other Records

Field Value
crawl-delay 5

megaindex.ru

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

cludo.com

Rule Path
Disallow /

python-requests

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /routes-schedules/schedules/point-to-point-schedules/
Disallow /tools-resources/nova-point-to-point-schedules/

dotbot

Rule Path
Disallow /