jornadageek.com.br
robots.txt

Robots Exclusion Standard data for jornadageek.com.br

Resource Scan

Scan Details

Site Domain jornadageek.com.br
Base Domain jornadageek.com.br
Scan Status Ok
Last Scan2024-09-25T01:17:08+00:00
Next Scan 2024-10-02T01:17:08+00:00

Last Scan

Scanned2024-09-25T01:17:08+00:00
URL https://jornadageek.com.br/robots.txt
Redirect https://jornadageek.ig.com.br/robots.txt
Redirect Domain jornadageek.ig.com.br
Redirect Base ig.com.br
Domain IPs 104.21.77.235, 172.67.212.149, 2606:4700:3032::ac43:d495, 2606:4700:3035::6815:4deb
Redirect IPs 104.18.28.20, 104.18.29.20, 2606:4700::6812:1c14, 2606:4700::6812:1d14
Response IP 104.18.28.20
Found Yes
Hash 94561997c5504ac433111c4f7baae8a426bd80e12351c581870813be2c47e91b
SimHash cc69fd554fb9

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /feed/
Disallow */feed
Disallow */feed$
Disallow /feed/$
Disallow /comments/feed
Disallow /?feed=
Disallow /wp-feed
Disallow /category/
Disallow /tag/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://jornadageek.ig.com.br/sitemap_index.xml
sitemap https://jornadageek.ig.com.br/news-sitemap.xml
sitemap https://jornadageek.ig.com.br/video-sitemap.xml
sitemap https://jornadageek.ig.com.br/web-story-sitemap.xml