sheffieldwednesday.news
robots.txt

Robots Exclusion Standard data for sheffieldwednesday.news

Resource Scan

Scan Details

Site Domain sheffieldwednesday.news
Base Domain sheffieldwednesday.news
Scan Status Ok
Last Scan2024-11-11T05:22:26+00:00
Next Scan 2024-11-18T05:22:26+00:00

Last Scan

Scanned2024-11-11T05:22:26+00:00
URL https://sheffieldwednesday.news/robots.txt
Redirect https://www.sheffieldwednesday.news/robots.txt
Redirect Domain www.sheffieldwednesday.news
Redirect Base sheffieldwednesday.news
Domain IPs 104.26.2.209, 104.26.3.209, 172.67.71.88, 2606:4700:20::681a:2d1, 2606:4700:20::681a:3d1, 2606:4700:20::ac43:4758
Redirect IPs 104.26.2.209, 104.26.3.209, 172.67.71.88, 2606:4700:20::681a:2d1, 2606:4700:20::681a:3d1, 2606:4700:20::ac43:4758
Response IP 172.67.71.88
Found Yes
Hash bf682445df60b3f8fc9ae196767fb05c3d62a4cb3b540826e3ad84a94429d7a7
SimHash 3bb89a002430

Groups

*

Rule Path
Disallow /core/wp-admin/
Allow /core/wp-admin/admin-ajax.php
Disallow /?s=

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.sheffieldwednesday.news/sitemap_index.xml

Comments

  • XML Sitemap & Google News version 5.3.6 - https://status301.net/wordpress-plugins/xml-sitemap-feed/
  • No XML Sitemaps are enabled on this site.
  • Block Common Crawl
  • Block Google Bard AI
  • Block Open AI