utne.com
robots.txt

Robots Exclusion Standard data for utne.com

Resource Scan

Scan Details

Site Domain utne.com
Base Domain utne.com
Scan Status Ok
Last Scan2024-11-08T15:04:35+00:00
Next Scan 2024-11-15T15:04:35+00:00

Last Scan

Scanned2024-11-08T15:04:35+00:00
URL https://utne.com/robots.txt
Redirect https://www.utne.com/robots.txt
Redirect Domain www.utne.com
Redirect Base utne.com
Domain IPs 108.157.254.129, 108.157.254.13, 108.157.254.77, 108.157.254.78
Redirect IPs 54.230.112.124, 54.230.112.17, 54.230.112.87, 54.230.112.95
Response IP 52.85.49.97
Found Yes
Hash 377ad41f34bccd63694c6d4a4e549c2c18971e60b820f7d15829d6ec72b05b59
SimHash 03a5b860cba3

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /_custom/*
Disallow /*/print/
Disallow /wp-json/*
Disallow /search/*
Disallow /email
Disallow /print
Disallow /print-article.aspx
Disallow /sso/*
Disallow /store/offer/*
Disallow /store/author/*
Disallow /watch/*
Disallow /uploadedFiles/*
Disallow /tags/*
Disallow /search
Disallow /contributors/*

rogerbot

Rule Path
Allow /*
Disallow /wp-admin/
Disallow /_custom/
Disallow /wp-json/

twitterbot

Rule Path
Allow /

dotbot

Rule Path
Allow /*
Disallow /wp-admin/
Disallow /wp-json/

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.utne.com/sitemap.xml