fireworksinillinois.com
robots.txt

Robots Exclusion Standard data for fireworksinillinois.com

Resource Scan

Scan Details

Site Domain fireworksinillinois.com
Base Domain fireworksinillinois.com
Scan Status Ok
Last Scan2024-09-25T04:00:44+00:00
Next Scan 2024-10-02T04:00:44+00:00

Last Scan

Scanned2024-09-25T04:00:44+00:00
URL https://fireworksinillinois.com/robots.txt
Domain IPs 104.21.2.212, 172.67.129.180, 2606:4700:3037::6815:2d4, 2606:4700:3037::ac43:81b4
Response IP 104.21.2.212
Found Yes
Hash dbd76af2ff00c4e7981585c8c0710897829d47535135c4b49578d202e638b47a
SimHash a921d94a4715

Groups

*

Rule Path
Disallow /backup$

*

Rule Path
Disallow /feed/

*

Rule Path
Disallow /archives/

*

Rule Path
Disallow /index.php

*

Rule Path
Disallow /*?

*

Rule Path
Disallow /*.php$

*

Rule Path
Disallow /*.inc$

*

Rule Path
Disallow */feed

*

Rule Path
Disallow */page

*

Rule Path
Disallow */trackback/

*

Rule Path
Disallow /tag/

*

Rule Path
Disallow /category/

*

Rule Path
Disallow /includes/

*

Rule Path
Allow /page-sitemap.xml

mj12bot

Rule Path
Disallow

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://fireworksinillinois.com/sitemap_index.xml

Comments

  • Edited by Meg 4/7/2016
  • Yea, I saw.