cdn.insertlive.com
robots.txt

Robots Exclusion Standard data for cdn.insertlive.com

Resource Scan

Scan Details

Site Domain cdn.insertlive.com
Base Domain insertlive.com
Scan Status Ok
Last Scan2024-06-18T15:21:25+00:00
Next Scan 2024-07-18T15:21:25+00:00

Last Scan

Scanned2024-06-18T15:21:25+00:00
URL https://cdn.insertlive.com/robots.txt
Domain IPs 103.49.221.172, 203.190.242.172
Response IP 103.49.221.172
Found Yes
Hash caa8650824e037e531b1f306afb15b083b5bafe7bd9aff6eff8ffa43297b083e
SimHash 79309f51b9b2

Groups

ahrefsbots

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

googlebot

Rule Path
Disallow /embed/
Disallow */logout
Disallow /api$
Disallow /api/
Disallow /search

chatgpt-user

Rule Path
Disallow /

openai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.insertlive.com/sitemap.xml