leicestercity.news
robots.txt

Robots Exclusion Standard data for leicestercity.news

Resource Scan

Scan Details

Site Domain leicestercity.news
Base Domain leicestercity.news
Scan Status Ok
Last Scan2024-09-23T23:36:13+00:00
Next Scan 2024-09-30T23:36:13+00:00

Last Scan

Scanned2024-09-23T23:36:13+00:00
URL https://leicestercity.news/robots.txt
Redirect https://www.leicestercity.news/robots.txt
Redirect Domain www.leicestercity.news
Redirect Base leicestercity.news
Domain IPs 104.21.54.136, 172.67.138.242, 2606:4700:3031::6815:3688, 2606:4700:3031::ac43:8af2
Redirect IPs 104.21.54.136, 172.67.138.242, 2606:4700:3031::6815:3688, 2606:4700:3031::ac43:8af2
Response IP 104.21.54.136
Found Yes
Hash 5558c589c4728e19aa6e5ab16062402b94c09a842fa8de0e2a03bd4dc2879654
SimHash 3b309a042434

Groups

*

Rule Path
Disallow /core/wp-admin/
Allow /core/wp-admin/admin-ajax.php
Disallow /?s=

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.leicestercity.news/sitemap_index.xml

Comments

  • XML Sitemap & Google News version 5.3.6 - https://status301.net/wordpress-plugins/xml-sitemap-feed/
  • No XML Sitemaps are enabled on this site.
  • Block Common Crawl
  • Block Google Bard AI
  • Block Open AI