techrepublic.com
robots.txt

Robots Exclusion Standard data for techrepublic.com

Resource Scan

Scan Details

Site Domain techrepublic.com
Base Domain techrepublic.com
Scan Status Ok
Last Scan2024-11-12T19:29:03+00:00
Next Scan 2024-11-19T19:29:03+00:00

Last Scan

Scanned2024-11-12T19:29:03+00:00
URL https://techrepublic.com/robots.txt
Redirect https://www.techrepublic.com/robots.txt
Redirect Domain www.techrepublic.com
Redirect Base techrepublic.com
Domain IPs 151.101.1.91, 151.101.129.91, 151.101.193.91, 151.101.65.91
Redirect IPs 151.101.1.91, 151.101.129.91, 151.101.193.91, 151.101.65.91, 2a04:4e42:200::347, 2a04:4e42:400::347, 2a04:4e42:600::347, 2a04:4e42::347
Response IP 199.232.45.91
Found Yes
Hash a64f1b20dffd05554332f52edc7949c90f02eb4a72e84985cef27aeebe0f43ad
SimHash 4a0159210fb2

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow *%26s%3D*
Disallow *%26q%3D*
Disallow */?s=*
Disallow */?q=*
Disallow /search$
Disallow /search?*
Disallow /search/
Disallow /resource/*
Disallow /resource/
Disallow /resource-library/search$
Disallow /resource-library/search?*
Disallow /resource-library/search/
Disallow /resource-library/*?*
Disallow /members/profile/
Disallow /memebers/profile/
Disallow /index.php/members/profile/
Disallow /5055/aw-techrepublic/
Disallow /5055/maw-techrepublic/

magpie-crawler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.techrepublic.com/sitemap_index.xml