theslicedpan.com
robots.txt

Robots Exclusion Standard data for theslicedpan.com

Resource Scan

Scan Details

Site Domain theslicedpan.com
Base Domain theslicedpan.com
Scan Status Ok
Last Scan2025-09-07T13:18:08+00:00
Next Scan 2025-10-07T13:18:08+00:00

Last Scan

Scanned2025-09-07T13:18:08+00:00
URL https://theslicedpan.com/robots.txt
Redirect https://www.theslicedpan.com/robots.txt
Redirect Domain www.theslicedpan.com
Redirect Base theslicedpan.com
Domain IPs 104.21.30.235, 172.67.174.44, 2606:4700:3034::ac43:ae2c, 2606:4700:3036::6815:1eeb
Redirect IPs 104.21.30.235, 172.67.174.44, 2606:4700:3034::ac43:ae2c, 2606:4700:3036::6815:1eeb
Response IP 104.21.30.235
Found Yes
Hash fafec25766f1f8213f7208239463724dbd677246e4ce14726ffa602d2cfc6d32
SimHash 2051d7025597

Groups

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

msnbot

Rule Path
Disallow

slurp

Rule Path
Disallow

teoma

Rule Path
Disallow

gigabot

Rule Path
Disallow

robozilla

Rule Path
Disallow

nutch

Rule Path
Disallow

ia_archiver

Rule Path
Disallow

baiduspider

Rule Path
Disallow

naverbot

Rule Path
Disallow

yeti

Rule Path
Disallow

yahoo-mmcrawler

Rule Path
Disallow

psbot

Rule Path
Disallow

yahoo-blogs/v3.9

Rule Path
Disallow

*

Rule Path
Disallow
Disallow /cgi-bin/

Other Records

Field Value
sitemap https://www.theslicedpan.com/sitemap.xml

Warnings

  • 1 invalid line.