thetech.com
robots.txt

Robots Exclusion Standard data for thetech.com

Resource Scan

Scan Details

Site Domain thetech.com
Base Domain thetech.com
Scan Status Ok
Last Scan2024-09-27T07:14:34+00:00
Next Scan 2024-10-04T07:14:34+00:00

Last Scan

Scanned2024-09-27T07:14:34+00:00
URL https://thetech.com/robots.txt
Domain IPs 3.210.156.221
Response IP 3.210.156.221
Found Yes
Hash 88d5804402113495a95900b702ba947c44ee491e659bba7327b4febb0d11d787
SimHash 48036886eeb2

Groups

*

Rule Path
Disallow /admin/
Disallow /search/
Disallow /image_search/
Disallow /api/
Disallow /niceties-manifest/

Other Records

Field Value
sitemap http://thetech.com/news_sitemap.xml
sitemap http://thetech.com/search_sitemap.xml

Comments

  • Sitemap for archive and news