e-yearbook.com
robots.txt

Robots Exclusion Standard data for e-yearbook.com

Resource Scan

Scan Details

Site Domain e-yearbook.com
Base Domain e-yearbook.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-10-08T02:24:45+00:00
Next Scan 2024-10-15T02:24:45+00:00

Last Successful Scan

Scanned2024-09-30T02:11:01+00:00
URL https://e-yearbook.com/robots.txt
Redirect https://www.e-yearbook.com/robots.txt
Redirect Domain www.e-yearbook.com
Redirect Base e-yearbook.com
Domain IPs 67.23.57.201
Redirect IPs 67.23.57.201
Response IP 67.23.57.201
Found Yes
Hash 1e09030e3ed04fbc659e6ee6b41088f7abac1e91a24953cd9ed254921825f010
SimHash 511c9b616f81

Groups

*

Rule Path
Disallow /tmp/
Disallow /scripts/
Disallow /sp/search
Disallow /sp/eybs

meta-externalagent

Rule Path
Disallow /books/

imagesiftbot
applebot

Rule Path
Disallow /books/

amazonbot

Rule Path
Disallow /books/

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

phxbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.e-yearbook.com/sitemap_index.xml
sitemap https://www.e-yearbook.com/sitemap_sum_index.xml

Comments

  • e-YearBook.com robots.txt
  • User-agent: Googlebot
  • Allow: /books/*/*/*/1.jpg$
  • Disallow: /books/
  • User-agent: Googlebot-image
  • Allow: /books/*/*/*/1.jpg$
  • Disallow: /books/