10best.com
robots.txt

Robots Exclusion Standard data for 10best.com

Resource Scan

Scan Details

Site Domain 10best.com
Base Domain 10best.com
Scan Status Ok
Last Scan2024-11-11T01:41:06+00:00
Next Scan 2024-11-18T01:41:06+00:00

Last Scan

Scanned2024-11-11T01:41:06+00:00
URL https://www.10best.com/robots.txt
Domain IPs 151.101.130.62, 151.101.194.62, 151.101.2.62, 151.101.66.62
Response IP 199.232.46.62
Found Yes
Hash 00965791b2f9ca6199c5b7f8d8edbee7ec1e82b8675fbccf3893a4f4a781cbaa
SimHash 51181840a573

Groups

*

Rule Path
Disallow /*/iframe/
Disallow /legal/
Disallow /insider/
Disallow /status.html
Disallow /awards/upload/
Disallow /awards/nominate/

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /
Allow /

Other Records

Field Value
sitemap https://www.10best.com/sitemap.xml
sitemap https://www.10best.com/sitemaps/sitemap-overview-and-cities-1.xml.gz
sitemap https://www.10best.com/sitemaps/sitemap-articles-and-galleries-1.xml.gz
sitemap https://www.10best.com/sitemaps/sitemap-restaurants-1.xml.gz
sitemap https://www.10best.com/sitemaps/sitemap-hotels-1.xml.gz
sitemap https://www.10best.com/sitemaps/sitemap-shopping-1.xml.gz
sitemap https://www.10best.com/sitemaps/sitemap-nightlife-1.xml.gz
sitemap https://www.10best.com/sitemaps/sitemap-attractions-1.xml.gz