receitas.globo.com
robots.txt

Robots Exclusion Standard data for receitas.globo.com

Resource Scan

Scan Details

Site Domain receitas.globo.com
Base Domain globo.com
Scan Status Ok
Last Scan2024-06-27T12:41:39+00:00
Next Scan 2024-07-11T12:41:39+00:00

Last Scan

Scanned2024-06-27T12:41:39+00:00
URL https://receitas.globo.com/robots.txt
Domain IPs 186.192.81.228
Response IP 186.192.81.228
Found Yes
Hash eaed2f4f35503e028e28a145a693e1fffef04f5cfee44711dc17b2329d0c67df
SimHash 2844d844a153

Groups

*

Rule Path
Disallow /busca/*

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://receitas.globo.com/sitemap/receitas/sitemap.xml