interestingengineering.com
robots.txt

Robots Exclusion Standard data for interestingengineering.com

Resource Scan

Scan Details

Site Domain interestingengineering.com
Base Domain interestingengineering.com
Scan Status Ok
Last Scan2024-06-08T10:32:57+00:00
Next Scan 2024-06-15T10:32:57+00:00

Last Scan

Scanned2024-06-08T10:32:57+00:00
URL https://interestingengineering.com/robots.txt
Domain IPs 104.26.14.179, 104.26.15.179, 172.67.75.65, 2606:4700:20::681a:eb3, 2606:4700:20::681a:fb3, 2606:4700:20::ac43:4b41
Response IP 104.26.15.179
Found Yes
Hash d1c280c70ad011a969aad2833b66a04142b8de8b9c4898758b594b619aa9c539
SimHash 498c4132a751

Groups

googlebot-news

Rule Path
Allow /

twitterbot

Rule Path
Disallow

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

*

Rule Path
Disallow /s/*
Disallow /redir/*
Disallow /newsletter/*
Disallow /partial/*
Disallow /*?context_item_id

Other Records

Field Value
sitemap https://interestingengineering.com/sitemap_index.xml
sitemap https://interestingengineering.com/news-sitemap.xml

Warnings

  • 1 invalid line.