generalcrumb.com
robots.txt

Robots Exclusion Standard data for generalcrumb.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	generalcrumb.com
Base Domain	generalcrumb.com
Scan Status	Ok
Last Scan	2025-05-22T03:16:51+00:00
Next Scan	2025-05-29T03:16:51+00:00

Last Scan

Scanned	2025-05-22T03:16:51+00:00
URL	https://generalcrumb.com/robots.txt
Domain IPs	192.0.78.175, 192.0.78.249
Response IP	192.0.78.249
Found	Yes
Hash	c887176331f753c84a5b04d27610fea56760a323b690cafa28f33c669bf13e9d
SimHash	eb28a880ec9b

Groups

*

Rule	Path
Disallow	/wp-content/uploads/wc-logs/
Disallow	/wp-content/uploads/woocommerce_transient_files/
Disallow	/wp-content/uploads/woocommerce_uploads/
Disallow	/wp-admin/
Allow	/wp-admin/admin-ajax.php

Rule

Path

Disallow

/wp-content/uploads/wc-logs/

Disallow

/wp-content/uploads/woocommerce_transient_files/

Disallow

/wp-content/uploads/woocommerce_uploads/

Disallow

/wp-admin/

Allow

/wp-admin/admin-ajax.php

*

Rule	Path
Disallow

Rule

Path

Disallow

Back to top

Other Records

Field	Value
sitemap	https://generalcrumb.com/sitemap.xml
sitemap	https://generalcrumb.com/news-sitemap.xml
sitemap	https://generalcrumb.com/sitemap_index.xml

Field

Value

sitemap

https://generalcrumb.com/sitemap.xml

sitemap

https://generalcrumb.com/news-sitemap.xml

sitemap

https://generalcrumb.com/sitemap_index.xml

Back to top

Comments

START YOAST BLOCK
---------------------------
---------------------------
END YOAST BLOCK

Back to top

generalcrumb.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

*

Other Records

Comments

generalcrumb.com
robots.txt