# Sitemap # ----------------------- # Sitemap: http://laguardiadejaen.com/sitemap/ # DENEGANDO ACCESOS POR ROBOTS: # ----------------------- # # Lista de bots que suelen respetar el robots.txt pero rara # vez hacen un buen uso del sitio y abusan bastante... # Aņadir al gusto del consumidor... # Gracias a Sigt: http://sigt.net/archivo/robotstxt-para-wordpress.xhtml User-agent: Yandex Disallow: / # Google AdSense (No tenemos Adsense, fuera.) User-agent: Mediapartners-Google Disallow: / # Google Images User-agent: Googlebot-Image Disallow: / # Google Ads User-agent: Adsbot-Google Disallow: / # Internet Archiver Wayback Machine User-agent: ia_archiver Disallow: / # digg mirror User-agent: duggmirror Disallow: / User-agent: MSIECrawler Disallow: / User-agent: WebCopier Disallow: / User-agent: HTTrack Disallow: / User-agent: Microsoft.URL.Control Disallow: / User-agent: libwww Disallow: / User-agent: Baiduspider Disallow: / User-agent: Speedy Spider Disallow: / User-agent: Sogou web spider Disallow: / User-agent: Sosospider Disallow: / User-agent: Java Disallow: / User-agent: Test.Buzzz Disallow: / User-agent: Linguee Bot Disallow: / # Ralentizando algunos bots raros. User-agent: noxtrumbot Crawl-delay: 50 User-agent: msnbot Crawl-delay: 30 User-agent: Slurp Crawl-delay: 10 # DENEGANDO ACCESOS POR URLS (absolutas): # ----------------------- # User-agent: * Disallow: http://laguardiadejaen.com/cgi-bin Disallow: http://laguardiadejaen.com/multimedia Disallow: http://laguardiadejaen.com/static Disallow: http://static.laguardiadejaen.com/ Disallow: http://laguardiadejaen.com/sandbox Disallow: http://sandbox.laguardiadejaen.com # Proyectos propios Home: Disallow: http://laguardiadejaen.com/guardiapedia Disallow: http://laguardiadejaen.com/foro Disallow: http://laguardiadejaen.com/foro-laguardia # Proyectos propios Web: Disallow: http://laguardiadejaen.com/web/apps Disallow: http://laguardiadejaen.com/web/cache Disallow: http://laguardiadejaen.com/web/7maravillas Disallow: http://laguardiadejaen.com/web/foro Disallow: http://laguardiadejaen.com/web/foro-laguardia # Web (Wordpress): Disallow: http://laguardiadejaen.com/coreweb Disallow: http://laguardiadejaen.com/web/wp-admin Disallow: http://laguardiadejaen.com/web/wp-content Disallow: http://laguardiadejaen.com/web/wp-includes ## Asegurar: Disallow: http://laguardiadejaen.com/web/wp-content/plugins Disallow: http://laguardiadejaen.com/web/wp-content/themes Disallow: http://laguardiadejaen.com/web/wp-content/upgrade Disallow: http://laguardiadejaen.com/web/wp-content/languages ## Rewrite de Wordpress, impedir: ## No permitir sin permalinks (esto deberia bastar para las siguientes): Disallow: http://laguardiadejaen.com/web/?* # Mas dinamicas: Disallow: http://laguardiadejaen.com/web/?p=* Disallow: http://laguardiadejaen.com/web/?s=* Disallow: http://laguardiadejaen.com/web/?dl_id=* # Con permalinks: Disallow: http://laguardiadejaen.com/web/search/* Disallow: http://laguardiadejaen.com/web/buscar/* Disallow: http://laguardiadejaen.com/web/usuario/* Disallow: http://laguardiadejaen.com/web/trackback/* Disallow: http://laguardiadejaen.com/web/feed/* Disallow: http://laguardiadejaen.com/web/comments/* Disallow: http://laguardiadejaen.com/web/comentarios/* # Asegurar: Disallow: */search Disallow: */buscar Disallow: */trackback Disallow: */feed Disallow: */author Disallow: */usuario Disallow: */comments Disallow: */comentarios # Tipos de ficheros # ----------------------- # ## JavaScript Disallow: /*.js$ ## CSS Disallow: /*.css$ ## Perl Disallow: /*.pl$ ## Perl Disallow: /*.ini$ ## Perl Disallow: /*.htaccess$ ## Perl Disallow: /*.bak$ ## Perl Disallow: /*.phps$