uk-air.defra.gov.uk
robots.txt

Robots Exclusion Standard data for uk-air.defra.gov.uk

Resource Scan

Scan Details

Site Domain uk-air.defra.gov.uk
Base Domain defra.gov.uk
Scan Status Ok
Last Scan2024-09-25T19:58:04+00:00
Next Scan 2024-10-25T19:58:04+00:00

Last Scan

Scanned2024-09-25T19:58:04+00:00
URL https://uk-air.defra.gov.uk/robots.txt
Domain IPs 138.199.9.104, 2400:52e0:1a01::899:1
Response IP 143.244.50.82
Found Yes
Hash 378f253b96ec302994638ad183888bfda9658d8daf5b3d3607ec2fae0c00286b
SimHash 3e74eb780f99

Groups

*

Rule Path
Disallow /assets/downloads/
Disallow /datastore/
Disallow /assets/weekly_graphs/
Disallow /assets/graphs/
Disallow /data-providers/
Disallow /forecasting/locations
Disallow /data/data_selector
Disallow /data/exceedence
Disallow /data/data-availability
Disallow /data/DAQI-regional-data
Disallow /data/non-auto-data
Disallow /data/gis-mapping
Disallow /data/openair
Disallow /data/laqm-background-maps
Disallow /data/ozone-data
Disallow /data/uv-data
Disallow /data/uv-index-graphs

Comments

  • All robots will spider the domain
  • Disallow directories
  • Disallow interactive data sections
  • to stop bots hammering the databases