marymartin.com
robots.txt

Robots Exclusion Standard data for marymartin.com

Resource Scan

Scan Details

Site Domain marymartin.com
Base Domain marymartin.com
Scan Status Ok
Last Scan2026-02-18T03:41:50+00:00
Next Scan 2026-02-25T03:41:50+00:00

Last Scan

Scanned2026-02-18T03:41:50+00:00
URL https://www.marymartin.com/robots.txt
Domain IPs 38.247.130.9
Response IP 38.247.130.9
Found Yes
Hash 178cc0716c8494582b518a1fe965f9aabf008436ddfe61a8bff24207b913d95a
SimHash 8176d9698673

Groups

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

*

Rule Path
Allow /ads.txt
Allow /app-ads.txt
Disallow /marc/
Disallow /marc/aboutus
Disallow /bo/
Disallow /PaypalRedirect/
Disallow /web/jsp/tagCloud/selectedIndex?
Disallow /nlb/
Disallow /web/selectedCatalog/
Disallow /web/searchIndex/
Disallow /web/catalogByCountry/
Disallow /web/mmbs/CatalogHtmls/
Disallow /web/mmbs/CatalogHtmls/viewbooks?
Disallow /web/mmbs/CatalogHtmls/imageAction?
Disallow /web/mmbs/CatalogHtmls/exportInExcel?
Disallow /web/jsp/tagCloud/viewbooks?
Disallow /web/jsp/tagCloud/imageAction?
Disallow /web/jsp/tagCloud/selectedCatalog?
Disallow /web/jsp/tagCloud/

Other Records

Field Value
crawl-delay 2

Comments

  • No ChatGPT user allowed anywhere on the site
  • No Google Bard allowed anywhere on the site
  • No Common Crawl Bot