cutthecrap.me
robots.txt

Robots Exclusion Standard data for cutthecrap.me

Archived Snapshots

Resource Scan

Scan Details

Site Domain	cutthecrap.me
Base Domain	cutthecrap.me
Scan Status	Ok
Last Scan	2025-12-08T14:30:39+00:00
Next Scan	2026-01-07T14:30:39+00:00

Last Scan

Scanned	2025-12-08T14:30:39+00:00
URL	https://cutthecrap.me/robots.txt
Domain IPs	162.159.134.42
Response IP	162.159.134.42
Found	Yes
Hash	3df4de9033fad397e4096ae3bf13f32d895012c87c0a81efc524a75a62a49f49
SimHash	e9408882e4bb

Groups

*

Rule	Path
Disallow	/wp-content/uploads/wc-logs/
Disallow	/wp-content/uploads/woocommerce_transient_files/
Disallow	/wp-content/uploads/woocommerce_uploads/
Disallow	/*?add-to-cart=
Disallow	/?add-to-cart=
Disallow	/wp-admin/
Allow	/wp-admin/admin-ajax.php

Rule

Path

Disallow

/wp-content/uploads/wc-logs/

Disallow

/wp-content/uploads/woocommerce_transient_files/

Disallow

/wp-content/uploads/woocommerce_uploads/

Disallow

/*?add-to-cart=

Disallow

/*?*add-to-cart=

Disallow

/wp-admin/

Allow

/wp-admin/admin-ajax.php

*

Rule	Path
Disallow

Rule

Path

Disallow

Back to top

Other Records

Field	Value
sitemap	https://cutthecrap.me/sitemap_index.xml

Field

Value

sitemap

https://cutthecrap.me/sitemap_index.xml

Back to top

Comments

START YOAST BLOCK
---------------------------
---------------------------
END YOAST BLOCK

Back to top

cutthecrap.merobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

*

Other Records

Comments

cutthecrap.me
robots.txt