uk.complex.com
robots.txt

Robots Exclusion Standard data for uk.complex.com

Resource Scan

Scan Details

Site Domain uk.complex.com
Base Domain complex.com
Scan Status Ok
Last Scan2024-04-26T04:06:52+00:00
Next Scan 2024-05-26T04:06:52+00:00

Last Scan

Scanned2024-04-26T04:06:52+00:00
URL https://uk.complex.com/robots.txt
Redirect https://www.complex.com/robots.txt
Redirect Domain www.complex.com
Redirect Base complex.com
Domain IPs 151.101.130.133, 151.101.194.133, 151.101.2.133, 151.101.66.133
Redirect IPs 151.101.130.133, 151.101.194.133, 151.101.2.133, 151.101.66.133
Response IP 199.232.46.133
Found Yes
Hash b7cee7e2a2291c8f8ea05ead3af3694128c021b833f015646af5f574ab307d2c
SimHash 292a0b64c975

Groups

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 120

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 4

grapeshot

Rule Path
Disallow /sitemap/

*

Rule Path
Disallow /core/
Disallow /dex/
Disallow /cgi-bin/
Disallow /includes/
Disallow /flash/
Disallow /plugins/
Disallow /system/
Disallow /widgets/
Disallow /assets/
Disallow /blogs/wp-content/
Disallow /blog-galleries/
Disallow /blogs/
Disallow /api/
Disallow /static/js/
Disallow /static/css/
Disallow /js/
Disallow /css/
Disallow /tv/
Disallow /search?*

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /
Allow /sitemap/

Other Records

Field Value
sitemap https://www.complex.com/sitemap/news.xml
sitemap https://www.complex.com/sitemap/index.xml