blog.syracuse.com
robots.txt

Robots Exclusion Standard data for blog.syracuse.com

Resource Scan

Scan Details

Site Domain blog.syracuse.com
Base Domain syracuse.com
Scan Status Ok
Last Scan2024-11-16T10:33:57+00:00
Next Scan 2024-11-23T10:33:57+00:00

Last Scan

Scanned2024-11-16T10:33:57+00:00
URL https://blog.syracuse.com/robots.txt
Redirect https://www.syracuse.com/robots.txt
Redirect Domain www.syracuse.com
Redirect Base syracuse.com
Domain IPs 151.101.131.52, 151.101.195.52, 151.101.3.52, 151.101.67.52
Redirect IPs 184.87.193.77, 184.87.193.82, 2600:1413:b000:13::b857:c192, 2600:1413:b000:13::b857:c194
Response IP 23.59.80.177
Found Yes
Hash b6e0bc9b459fbfb37feb7a21a05f95ce6f16a1b76fc6abde63f8316a71539ecd
SimHash 59b36941a5e6

Groups

meta-externalagent

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

*

Rule Path
Disallow /workspace-betting/*

bingbot

Rule Path
Disallow /home-beta/
Disallow /test/
Disallow /preview/
Disallow /staging/
Disallow /clients/
Disallow /auctions/
Disallow /cgi-bin/
Disallow /printer/
Disallow /*/print.html*
Disallow /*/print.ssf*
Disallow /saints/
Disallow /landingpages/
Disallow /oil-spill-reports/
Disallow /premium/
Disallow /contests/scrape/
Disallow /puzzles-palace/
Disallow /toprailapp/
Disallow /*mt-preview-*
Disallow /puzzles-games/
Disallow /comics/
Disallow /puzzle-society/
Disallow /go-comics/
Disallow /comics-kingdom/

facebot

Rule Path
Disallow /home-beta/
Disallow /test/
Disallow /preview/
Disallow /staging/
Disallow /clients/
Disallow /auctions/
Disallow /cgi-bin/
Disallow /printer/
Disallow /*/print.html*
Disallow /*/print.ssf*
Disallow /test_team/
Disallow /premium/
Disallow /*mt-preview-*
Disallow /puzzles-games/
Disallow /comics/
Disallow /puzzle-society/
Disallow /go-comics/
Disallow /comics-kingdom/

googlebot

Rule Path
Disallow /home-beta/
Disallow /test/
Disallow /preview/
Disallow /staging/
Disallow /clients/
Disallow /auctions/
Disallow /cgi-bin/
Disallow /printer/
Disallow /*/print.html*
Disallow /*/print.ssf*
Disallow /landingpages/
Disallow /oil-spill-reports/
Disallow /premium/
Disallow /contests/scrape/
Disallow /puzzles-palace/
Disallow /toprailapp/
Disallow /*mt-preview-*
Disallow /puzzles-games/
Disallow /comics/
Disallow /puzzle-society/
Disallow /go-comics/
Disallow /comics-kingdom/

proximic

Rule Path
Disallow /home-beta/
Disallow /test/

slurp

Rule Path
Disallow /home-beta/
Disallow /test/
Disallow /preview/
Disallow /staging/
Disallow /clients/
Disallow /auctions/
Disallow /cgi-bin/
Disallow /printer/
Disallow /*/print.html*
Disallow /*/print.ssf*
Disallow /landingpages/
Disallow /oil-spill-reports/
Disallow /premium/
Disallow /contests/scrape/
Disallow /puzzles-palace/
Disallow /toprailapp/
Disallow /*mt-preview-*
Disallow /puzzles-games/
Disallow /comics/
Disallow /puzzle-society/
Disallow /go-comics/
Disallow /comics-kingdom/

twitterbot

Rule Path
Disallow /home-beta/
Disallow /preview/
Disallow /staging/
Disallow /clients/
Disallow /auctions/
Disallow /cgi-bin/
Disallow /printer/
Disallow /*/print.html*
Disallow /*/print.ssf*
Disallow /landingpages/
Disallow /oil-spill-reports/
Disallow /premium/
Disallow /contests/scrape/
Disallow /puzzles-palace/
Disallow /toprailapp/
Disallow /*mt-preview-*
Disallow /puzzles-games/
Disallow /comics/
Disallow /puzzle-society/
Disallow /go-comics/
Disallow /comics-kingdom/

*

Rule Path
Disallow /digitalsubscription/imagedownload/

Other Records

Field Value
sitemap https://www.syracuse.com/arc/outboundfeeds/rss-latest/?outputType=xml
sitemap https://www.syracuse.com/arc/outboundfeeds/sitemap-index/?outputType=xml
sitemap https://www.syracuse.com/arc/outboundfeeds/news-sitemap-index/?outputType=xml