uol.com
robots.txt

Robots Exclusion Standard data for uol.com

Resource Scan

Scan Details

Site Domain uol.com
Base Domain uol.com
Scan Status Ok
Last Scan2024-05-23T07:49:34+00:00
Next Scan 2024-05-30T07:49:34+00:00

Last Scan

Scanned2024-05-23T07:49:34+00:00
URL https://uol.com/robots.txt
Redirect https://www.uol.com.br/robots.txt
Redirect Domain www.uol.com.br
Redirect Base uol.com.br
Domain IPs 18.155.68.128, 18.155.68.33, 18.155.68.93, 18.155.68.97
Redirect IPs 13.33.30.110, 13.33.30.113, 13.33.30.17, 13.33.30.29, 2600:9000:229f:4200:1:5a19:8b40:93a1, 2600:9000:229f:5400:1:5a19:8b40:93a1, 2600:9000:229f:7a00:1:5a19:8b40:93a1, 2600:9000:229f:9a00:1:5a19:8b40:93a1, 2600:9000:229f:ae00:1:5a19:8b40:93a1, 2600:9000:229f:b600:1:5a19:8b40:93a1, 2600:9000:229f:ec00:1:5a19:8b40:93a1, 2600:9000:229f:f600:1:5a19:8b40:93a1
Response IP 184.27.123.40
Found Yes
Hash c7a972863537e7dd6896dbca0ab226a025f78497c972d6edc10f7b3acfdfbdb4
SimHash ed049a616713

Groups

*

Rule Path
Allow /
Disallow /carros/dev/

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.uol.com.br/carros/sitemap/v2/news-01.xml
sitemap https://www.uol.com.br/ecoa/sitemap/news-01.xml
sitemap https://www.uol.com.br/esporte/sitemap/v2/news-01.xml
sitemap https://www.uol.com.br/nossa/sitemap/news-01.xml
sitemap https://www.uol.com.br/splash/sitemap/news-01.xml
sitemap https://www.uol.com.br/tilt/sitemap/news-01.xml
sitemap https://www.uol.com.br/universa/sitemap/v2/news-01.xml
sitemap https://www.uol.com.br/vivabem/sitemap/v2/news-01.xml

Comments

  • robots.txt