pennlive.com
robots.txt

Robots Exclusion Standard data for pennlive.com

Resource Scan

Scan Details

Site Domain pennlive.com
Base Domain pennlive.com
Scan Status Ok
Last Scan2024-06-22T18:08:44+00:00
Next Scan 2024-06-29T18:08:44+00:00

Last Scan

Scanned2024-06-22T18:08:44+00:00
URL https://pennlive.com/robots.txt
Redirect https://www.pennlive.com:443/robots.txt
Redirect Domain www.pennlive.com
Redirect Base pennlive.com
Domain IPs 75.2.53.215, 99.83.138.34
Redirect IPs 23.45.207.209, 23.45.207.210, 2600:1413:b000:14::b857:c147, 2600:1413:b000:14::b857:c14d
Response IP 42.99.140.194
Found Yes
Hash 26023672643b30e941c54183b7333aedb2f40d4cb8d9c573f22591f38aa23bf2
SimHash 4231cb41a774

Groups

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /home-beta/
Disallow /test/
Disallow /preview/
Disallow /staging/
Disallow /clients/
Disallow /auctions/
Disallow /cgi-bin/
Disallow /printer/
Disallow /*/print.html*
Disallow /*/print.ssf*
Disallow /landingpages/
Disallow /oil-spill-reports/
Disallow /premium/
Disallow /contests/scrape/
Disallow /celebrity-news/
Disallow /comics-kingdom/
Disallow /puzzles-games/
Disallow /wedding-stories/
Disallow /mattress/
Disallow /prudential/
Disallow /mid-penn-awnings/
Disallow /us-politics/
Disallow /living/dash/
Disallow /letters/
Disallow /silive_sandbox_blog_-_azywusko
Disallow /toprailapp/
Disallow /*mt-preview-*
Disallow /puzzles-games/
Disallow /comics/
Disallow /puzzle-society/
Disallow /go-comics/
Disallow /comics-kingdom/

googlebot

Rule Path
Disallow /home-beta/
Disallow /test/
Disallow /preview/
Disallow /staging/
Disallow /clients/
Disallow /auctions/
Disallow /cgi-bin/
Disallow /printer/
Disallow /*/print.html*
Disallow /*/print.ssf*
Disallow /landingpages/
Disallow /oil-spill-reports/
Disallow /premium/
Disallow /contests/scrape/
Disallow /celebrity-news/
Disallow /comics-kingdom/
Disallow /puzzles-games/
Disallow /wedding-stories/
Disallow /mattress/
Disallow /prudential/
Disallow /mid-penn-awnings/
Disallow /us-politics/
Disallow /living/dash/
Disallow /letters/
Disallow /puzzles-palace/
Disallow /silive_sandbox_blog_-_azywusko
Disallow /toprailapp/
Disallow /*mt-preview-*
Disallow /puzzles-games/
Disallow /comics/
Disallow /puzzle-society/
Disallow /go-comics/
Disallow /comics-kingdom/

proximic

Rule Path
Disallow /home-beta/
Disallow /test/

slurp

Rule Path
Disallow /home-beta/
Disallow /test/
Disallow /preview/
Disallow /staging/
Disallow /clients/
Disallow /auctions/
Disallow /cgi-bin/
Disallow /printer/
Disallow /*/print.html*
Disallow /*/print.ssf*
Disallow /landingpages/
Disallow /oil-spill-reports/
Disallow /premium/
Disallow /contests/scrape/
Disallow /celebrity-news/
Disallow /comics-kingdom/
Disallow /puzzles-games/
Disallow /wedding-stories/
Disallow /mattress/
Disallow /prudential/
Disallow /mid-penn-awnings/
Disallow /us-politics/
Disallow /living/dash/
Disallow /letters/
Disallow /puzzles-palace/
Disallow /toprailapp/
Disallow /*mt-preview-*
Disallow /puzzles-games/
Disallow /comics/
Disallow /puzzle-society/
Disallow /go-comics/
Disallow /comics-kingdom/

twitterbot

Rule Path
Disallow /home-beta/
Disallow /test/
Disallow /preview/
Disallow /staging/
Disallow /clients/
Disallow /auctions/
Disallow /cgi-bin/
Disallow /printer/
Disallow /*/print.html*
Disallow /*/print.ssf*
Disallow /landingpages/
Disallow /oil-spill-reports/
Disallow /premium/
Disallow /contests/scrape/
Disallow /celebrity-news/
Disallow /comics-kingdom/
Disallow /puzzles-games/
Disallow /wedding-stories/
Disallow /mattress/
Disallow /prudential/
Disallow /mid-penn-awnings/
Disallow /us-politics/
Disallow /living/dash/
Disallow /letters/
Disallow /puzzles-palace/
Disallow /toprailapp/
Disallow /*mt-preview-*
Disallow /puzzles-games/
Disallow /comics/
Disallow /puzzle-society/
Disallow /go-comics/
Disallow /comics-kingdom/

*

Rule Path
Disallow /digitalsubscription/imagedownload/
Disallow /workspace-betting/*

Other Records

Field Value
sitemap https://www.pennlive.com/arc/outboundfeeds/rss-latest/?outputType=xml
sitemap https://www.pennlive.com/arc/outboundfeeds/sitemap-index/?outputType=xml
sitemap https://www.pennlive.com/arc/outboundfeeds/news-sitemap-index/?outputType=xml