simon.com
robots.txt
Robots Exclusion Standard data for simon.com
Resource Scan
Scan Details
Site Domain | simon.com |
Base Domain | simon.com |
Scan Status | Ok |
Last Scan | 2024-11-12T18:06:38+00:00 |
Next Scan | 2024-11-19T18:06:38+00:00 |
Last Scan
Scanned | 2024-11-12T18:06:38+00:00 |
URL | https://simon.com/robots.txt |
Redirect | https://www.simon.com/robots.txt |
Redirect Domain | www.simon.com |
Redirect Base | simon.com |
Domain IPs | 20.69.216.13 |
Redirect IPs | 116.51.25.100, 116.51.25.101, 116.51.25.102, 116.51.25.99 |
Response IP | 116.51.25.102 |
Found | Yes |
Hash | 3d01eb4141b8f527d34dade5f59525075fdd385349077db8c84e889121863990 |
SimHash | f419111387b5 |
Groups
amazonbot
anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
imagesiftbot
img2dataset
omgili
omgilibot
perplexitybot
youbot
Rule | Path |
---|---|
Disallow | / |
*
Rule | Path |
---|---|
Disallow | /about_simon/RadControls/ |
Disallow | /brand/ |
Disallow | /contentstream/ |
Disallow | /contentstreamcsi/ |
Disallow | /email/rscdealemailcreate/ |
Disallow | /errors/ |
Disallow | /retailpromotions/passwordrequirements.html |
Disallow | /retailshowcase/reporting/PrintAllRSCOffers.aspx |
Disallow | /system/ |
Disallow | /volume/blank.aspx |
Disallow | /volume/help.aspx |
Disallow | /wifi/ |
Disallow | /mall/*/directions/ |
Disallow | /mall/*/directions |
Disallow | /sms-opt-in/ |
Disallow | /email/* |
Disallow | /mall/*/stores/print/* |
Disallow | /bot-challenge |
Other Records
Field | Value |
---|---|
sitemap | https://www.simon.com/sitemap.xml |
Comments