themanual.com
robots.txt

Robots Exclusion Standard data for themanual.com

Resource Scan

Scan Details

Site Domain themanual.com
Base Domain themanual.com
Scan Status Ok
Last Scan2024-06-07T19:58:15+00:00
Next Scan 2024-06-14T19:58:15+00:00

Last Scan

Scanned2024-06-07T19:58:15+00:00
URL https://themanual.com/robots.txt
Redirect https://www.themanual.com/robots.txt
Redirect Domain www.themanual.com
Redirect Base themanual.com
Domain IPs 192.0.66.184
Redirect IPs 192.0.66.184
Response IP 192.0.66.184
Found Yes
Hash 7a61268cc778ba296eb2a79b99a8a2f5f9f94211211365901188b062edb8ff5f
SimHash 4d7ddce48ad2

Groups

magpie-crawler

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /uncategorized/*
Disallow /aop/*
Disallow /tag/*
Disallow /?s=
Disallow /search

Other Records

Field Value
sitemap https://www.themanual.com/sitemap-all-content-index.xml
sitemap https://www.themanual.com/sitemap-all-images-index.xml
sitemap https://www.themanual.com/sitemap-deals-sitemap-index.xml
sitemap https://www.themanual.com/sitemap-google-news-index.xml
sitemap https://www.themanual.com/sitemap-latest-500-index.xml
sitemap https://www.themanual.com/sitemap-news-sitemap-index.xml