alwayth.com
robots.txt
Robots Exclusion Standard data for alwayth.com
Resource Scan
Scan Details
Site Domain | alwayth.com |
Base Domain | alwayth.com |
Scan Status | Ok |
Last Scan | 2024-11-02T07:36:20+00:00 |
Next Scan | 2024-11-16T07:36:20+00:00 |
Last Scan
Scanned | 2024-11-02T07:36:20+00:00 |
URL | https://www.alwayth.com/robots.txt |
Domain IPs | 13.230.149.252, 3.113.186.52, 54.249.246.233 |
Response IP | 3.113.186.52 |
Found | Yes |
Hash | 969abf491dab18882eeeb92920c1e6dfc6eb07050c38882c9779c956119af207 |
SimHash | 421cc890e7d3 |
Groups
thesis-research-bot
fidget-spinner-bot
my-tiny-bot
semrushbot
ahrefsbot
dotbot
mj12bot
amazonbot
go-http-client
geedoproductsearch
Rule | Path |
---|---|
Disallow | / |
bingbot
Rule | Path |
---|---|
Allow | / |
Disallow | /cart/ |
Disallow | /web_cart/ |
Disallow | /shops/ |
Disallow | /en/shops/ |
Disallow | /api/shops/ |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
*
Rule | Path |
---|---|
Allow | / |
Disallow | /cart/ |
Disallow | /web_cart/ |
Disallow | /shops/ |
Disallow | /en/shops/ |
Disallow | /api/shops/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.alwayth.com/sitemap.xml |