theboxinthebox.com
robots.txt

Robots Exclusion Standard data for theboxinthebox.com

Resource Scan

Scan Details

Site Domain theboxinthebox.com
Base Domain theboxinthebox.com
Scan Status Ok
Last Scan2024-11-02T10:32:03+00:00
Next Scan 2024-12-02T10:32:03+00:00

Last Scan

Scanned2024-11-02T10:32:03+00:00
URL https://theboxinthebox.com/robots.txt
Domain IPs 192.0.78.190, 192.0.78.221
Response IP 192.0.78.221
Found Yes
Hash 52b9fd1697bc600d042c72a7040bf6fc2db00ff0774145900f9e795eb4558980
SimHash eb0188226f93

Groups

*

Rule Path
Disallow /wp-content/uploads/wc-logs/
Disallow /wp-content/uploads/woocommerce_transient_files/
Disallow /wp-content/uploads/woocommerce_uploads/
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://theboxinthebox.com/sitemap.xml
sitemap https://theboxinthebox.com/news-sitemap.xml