Index | index by Group | index by Distribution | index by Vendor | index by creation date | index by Name | Mirrors | Help | Search |
Name: perl-WWW-RobotRules | Distribution: openSUSE Tumbleweed |
Version: 6.02 | Vendor: openSUSE |
Release: 9.12 | Build date: Mon Feb 20 11:48:55 2012 |
Group: Development/Libraries/Perl | Build host: reproducible |
Size: 24917 | Source RPM: perl-WWW-RobotRules-6.02-9.12.src.rpm |
Packager: http://bugs.opensuse.org | |
Url: http://search.cpan.org/dist/WWW-RobotRules/ | |
Summary: database of robots.txt-derived permissions |
This module parses _/robots.txt_ files as specified in "A Standard for Robot Exclusion", at <http://www.robotstxt.org/wc/norobots.html> Webmasters can use the _/robots.txt_ file to forbid conforming robots from accessing parts of their web site. The parsed files are kept in a WWW::RobotRules object, and this object provides methods to check if access to a given URL is prohibited. The same WWW::RobotRules object can be used for one or more parsed _/robots.txt_ files on any number of hosts. The following methods are provided: * $rules = WWW::RobotRules->new($robot_name) This is the constructor for WWW::RobotRules objects. The first argument given to new() is the name of the robot. * $rules->parse($robot_txt_url, $content, $fresh_until) The parse() method takes as arguments the URL that was used to retrieve the _/robots.txt_ file, and the contents of the file. * $rules->allowed($uri) Returns TRUE if this robot is allowed to retrieve this URL. * $rules->agent([$name]) Get/set the agent name. NOTE: Changing the agent name will clear the robots.txt rules and expire times out of the cache.
Artistic-1.0 or GPL-1.0+
* Mon Feb 20 2012 coolo@suse.com - updated to 6.02 * Restore perl-5.8.1 compatiblity. * Mon Mar 14 2011 vcizek@novell.com - initial package 6.01 * created by cpanspec 1.78.03
/usr/lib/perl5/vendor_perl/5.42.0/WWW /usr/lib/perl5/vendor_perl/5.42.0/WWW/RobotRules /usr/lib/perl5/vendor_perl/5.42.0/WWW/RobotRules.pm /usr/lib/perl5/vendor_perl/5.42.0/WWW/RobotRules/AnyDBM_File.pm /usr/share/doc/packages/perl-WWW-RobotRules /usr/share/doc/packages/perl-WWW-RobotRules/Changes /usr/share/doc/packages/perl-WWW-RobotRules/README /usr/share/man/man3/WWW::RobotRules.3pm.gz /usr/share/man/man3/WWW::RobotRules::AnyDBM_File.3pm.gz
Generated by rpm2html 1.8.1
Fabrice Bellet, Thu Oct 23 22:37:43 2025