actu.cafe
robots.txt

Robots Exclusion Standard data for actu.cafe

Resource Scan

Scan Details

Site Domain actu.cafe
Base Domain actu.cafe
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-07-07T14:49:47+00:00
Next Scan 2024-10-05T14:49:47+00:00

Last Successful Scan

Scanned2022-07-10T14:22:59+00:00
URL https://actu.cafe/robots.txt
Redirect https://actudotcafe.wordpress.com/robots.txt
Redirect Domain actudotcafe.wordpress.com
Redirect Base wordpress.com
Response IP 192.0.78.13
Found Yes
Hash 8c006979ad8e104d96d885f9b0c4be510ac852258eea831c54053fb54cd7c950
SimHash b3169ace3cb2

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-login.php
Disallow /wp-signup.php
Disallow /press-this.php
Disallow /remote-login.php
Disallow /activate/
Disallow /cgi-bin/
Disallow /mshots/v1/
Disallow /next/
Disallow /public.api/

Other Records

Field Value
sitemap https://actudotcafe.wordpress.com/sitemap.xml
sitemap https://actudotcafe.wordpress.com/news-sitemap.xml

Comments

  • If you are regularly crawling WordPress.com sites, please use our firehose to receive real-time push updates instead.
  • Please see https://developer.wordpress.com/docs/firehose/ for more details.
  • This file was generated on Sat, 04 Sep 2021 19:40:16 +0000