web3cafe.in
robots.txt

Robots Exclusion Standard data for web3cafe.in

Resource Scan

Scan Details

Site Domain web3cafe.in
Base Domain web3cafe.in
Scan Status Ok
Last Scan2024-05-28T02:37:53+00:00
Next Scan 2024-06-04T02:37:53+00:00

Last Scan

Scanned2024-05-28T02:37:53+00:00
URL https://web3cafe.in/robots.txt
Redirect https://www.web3cafe.in/robots.txt
Redirect Domain www.web3cafe.in
Redirect Base web3cafe.in
Domain IPs 23.45.207.202, 23.45.207.209, 2600:1413:b000:13::b857:c18a, 2600:1413:b000:13::b857:c19c
Redirect IPs 23.54.118.74, 23.54.118.76, 2600:1413:b000:13::b857:c18a, 2600:1413:b000:13::b857:c19c
Response IP 23.52.171.137
Found Yes
Hash add7c4227473bc9407fef34f8394cb99397df929671612d9eef118a6af0dde2c
SimHash 8b0e42168cb0

Groups

*

Rule Path
Allow /
Disallow /topic/*
Disallow /visualstories/preview.php?*

googlebot-news

Rule Path
Disallow /visualstories/preview.php?*

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.web3cafe.in/rssfeeds/sitemap.xml
sitemap https://www.web3cafe.in/rssfeeds/news-sitemap.xml
sitemap https://www.web3cafe.in/rssfeeds/date-wise-stories-sitemap.xml
sitemap https://www.web3cafe.in/visualstories/webstories-sitemap.xml
sitemap https://www.web3cafe.in/visualstories/sitemap.xml