aan.sa
robots.txt
Robots Exclusion Standard data for aan.sa
Resource Scan
Scan Details
Site Domain | aan.sa |
Base Domain | aan.sa |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-10-04T07:11:49+00:00 |
Next Scan | 2025-01-02T07:11:49+00:00 |
Last Successful Scan
Scanned | 2023-11-17T06:58:57+00:00 |
URL | https://aan.sa/robots.txt |
Domain IPs | 104.21.18.230, 172.67.183.223, 2606:4700:3033::6815:12e6, 2606:4700:3036::ac43:b7df |
Response IP | 172.67.183.223 |
Found | Yes |
Hash | 98d8d53067b7503fe375ff5b4aac73eb1adb6667de30839f4d649e9f74a7513f |
SimHash | 634767710ba7 |
Groups
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
*
Rule | Path |
---|---|
Allow | / |
Disallow | /*%26lt%3Biframe |
Disallow | /*?currency= |
Disallow | /*/gateway/checkout* |
Disallow | /payment/* |
Disallow | /thankyou/* |
Disallow | /*/p*?page=* |
Disallow | /*/page-*?page=* |
Disallow | /cart |
Disallow | */redirect |
Warnings
- 10 invalid lines.