septa.org
robots.txt
Robots Exclusion Standard data for septa.org
Resource Scan
Scan Details
Site Domain | septa.org |
Base Domain | septa.org |
Scan Status | Ok |
Last Scan | 2024-11-06T20:52:42+00:00 |
Next Scan | 2024-12-06T20:52:42+00:00 |
Last Scan
Scanned | 2024-11-06T20:52:42+00:00 |
URL | https://www.septa.org/robots.txt |
Domain IPs | 13.35.210.25, 13.35.210.48, 13.35.210.50, 13.35.210.69 |
Response IP | 13.35.210.69 |
Found | Yes |
Hash | d1086fbabf3a0d6bba95ee2b713a7cf121f7a775c2edb7b3f42d98421bfbde81 |
SimHash | 4a8778236f45 |
Groups
*
Rule | Path |
---|---|
Disallow | *.doc |
Disallow | *.docx |
Disallow | *.gif |
Disallow | *.htm |
Disallow | *.html |
Disallow | *.jpg |
Disallow | *.jpeg |
Disallow | |
Disallow | *.php |
Disallow | *.png |
Disallow | *.ppt |
Disallow | *.pptx |
Disallow | *.rtf |
Disallow | *.shtm |
Disallow | *.shtml |
Disallow | *.svg |
Disallow | *.txt |
Disallow | *.xhtml |
Disallow | *.xls |
Disallow | *.xlsx |
Disallow | *.webp |