longreads.com
robots.txt

Robots Exclusion Standard data for longreads.com

Resource Scan

Scan Details

Site Domain longreads.com
Base Domain longreads.com
Scan Status Ok
Last Scan2024-06-13T11:17:50+00:00
Next Scan 2024-06-20T11:17:50+00:00

Last Scan

Scanned2024-06-13T11:17:50+00:00
URL https://longreads.com/robots.txt
Domain IPs 192.0.79.32, 192.0.79.33
Response IP 192.0.79.32
Found Yes
Hash ac28d29159ccb8e18b498dde9932e871fd4cf07f799b8b476920e9f893c96dde
SimHash 09a4d840a823

Groups

*

Rule Path
Disallow

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://longreads.com/sitemap.xml
sitemap https://longreads.com/news-sitemap.xml
sitemap https://longreads.com/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • OpenAI crawler
  • ChatGPT service
  • Common Crawl crawler
  • Bard/Gemini service
  • Imagesift by Hive
  • ---------------------------
  • END YOAST BLOCK