Wget Git mirror
Go to file
Matthew White 8aca8fc80d Bugfix: Process Metalink/XML url strings containing white spaces and CRLF
* src/metalink.h: Add declaration of function clean_metalink_string()
* src/metalink.c: Add directive #include "xmemdup0.h"
* src/metalink.c: Add function clean_metalink_string() remove leading
  and trailing white spaces and CRLF from string
* src/metalink.c (retrieve_from_metalink): Remove leading and trailing
  white spaces and CRLF from url resource mres->url
* testenv/Makefile.am: Add new file
* testenv/Test-metalink-xml-urlbreak.py: New test. Metalink/XML white
  spaces and CRLF in url resources tests

White spaces and CRLF are not automatically removed by libmetalink
from url strings. The Wget's Metalink module was unable to process
such url strings. This patch implements the processing of such url
strings cleaning off leading and trailing white spaces and CRLF.

If a parsed Metalink/XML url string contains strings separated by
CRLF, only the first of the series is accepted.
2016-09-30 19:44:05 +02:00
build-aux Fix missing extern declaration error for build_info.pl 2014-11-22 17:26:06 +05:30
contrib Faster Travis Builds 2015-12-10 22:43:31 +01:00
doc New: Metalink/XML and Metalink/HTTP file naming safety rules 2016-09-30 19:44:05 +02:00
gnulib@097a5928a7 Update gnulib and bootstrap 2016-06-14 09:27:58 +02:00
m4 Fix autoconf warning 2012-09-30 17:48:13 +02:00
msdos maint: update copyright year ranges to include 2015 2015-03-09 16:32:01 +01:00
po Metalink support. 2015-07-20 15:30:39 +02:00
src Bugfix: Process Metalink/XML url strings containing white spaces and CRLF 2016-09-30 19:44:05 +02:00
testenv Bugfix: Process Metalink/XML url strings containing white spaces and CRLF 2016-09-30 19:44:05 +02:00
tests Fix Test-iri-forced-remote 2015-12-20 21:32:06 +01:00
util paramcheck: use explicit quoting for here-docs 2015-05-04 10:13:24 +05:30
vms Do not depend on always defined macros 2014-06-12 18:49:15 +02:00
.gitignore Remove ABOUT-NLS from git 2015-12-11 12:11:55 +01:00
.gitmodules gnulib: add as a git submodule 2013-12-22 14:12:05 +01:00
.travis.yml Add bug-wget to list of travis recipients 2015-12-09 09:28:58 +01:00
.x-sc_po_check [mq]: cfg-mk 2009-09-21 20:39:44 -07:00
.x-sc_trailing_blank [mq]: cfg-mk 2009-09-21 20:39:44 -07:00
AUTHORS Fix typo. 2010-09-07 18:05:19 +02:00
bootstrap Update gnulib and bootstrap 2016-06-14 09:27:58 +02:00
bootstrap.conf Code cleanup for --use-askpass 2016-09-08 09:07:32 +02:00
cfg.mk Fix some make syntax-check issues 2016-08-21 15:35:36 +02:00
ChangeLog Revert "Print the fingerprint instead of the raw pointer in debugging message" 2016-03-29 15:07:29 +02:00
ChangeLog-2014-12-10 Fix typos 2016-07-02 19:01:24 +02:00
configure.ac Improve PSL cookie checking 2016-08-17 16:32:26 +02:00
COPYING Fix copyright year. 2011-05-19 10:34:26 +02:00
MAILING-LIST Remove pointer to unused Mailing List 2016-02-13 01:39:03 +01:00
Makefile.am * Makefile.am: Remove trailing empty line 2016-09-08 14:20:33 +02:00
NEWS New: Metalink/XML and Metalink/HTTP file naming safety rules 2016-09-30 19:44:05 +02:00
README Remove MAINTAINER from the README file. 2010-07-12 13:56:58 +02:00
README.checkout Explicitely mention Python3 in README.checkout 2016-06-10 15:14:43 +02:00

                                                          -*- text -*-
GNU Wget
========
                  Current Web home: http://www.gnu.org/software/wget/

GNU Wget is a free utility for non-interactive download of files from
the Web.  It supports HTTP, HTTPS, and FTP protocols, as well as
retrieval through HTTP proxies.

It can follow links in HTML pages and create local versions of remote
web sites, fully recreating the directory structure of the original
site.  This is sometimes referred to as "recursive downloading."
While doing that, Wget respects the Robot Exclusion Standard
(/robots.txt).  Wget can be instructed to convert the links in
downloaded HTML files to the local files for offline viewing.

Recursive downloading also works with FTP, where Wget can retrieves a
hierarchy of directories and files.

With both HTTP and FTP, Wget can check whether a remote file has
changed on the server since the previous run, and only download the
newer files.

Wget has been designed for robustness over slow or unstable network
connections; if a download fails due to a network problem, it will
keep retrying until the whole file has been retrieved.  If the server
supports regetting, it will instruct the server to continue the
download from where it left off.

If you are behind a firewall that requires the use of a socks style
gateway, you can get the socks library and compile wget with support
for socks.

Most of the features are configurable, either through command-line
options, or via initialization file .wgetrc.  Wget allows you to
install a global startup file (/usr/local/etc/wgetrc by default) for
site settings.

Wget works under almost all Unix variants in use today and, unlike
many of its historical predecessors, is written entirely in C, thus
requiring no additional software, such as Perl.  The external software
it does work with, such as OpenSSL, is optional.  As Wget uses the GNU
Autoconf, it is easily built on and ported to new Unix-like systems.
The installation procedure is described in the INSTALL file.

As with other GNU software, the latest version of Wget can be found at
the master GNU archive site ftp.gnu.org, and its mirrors.  Wget
resides at <ftp://ftp.gnu.org/pub/gnu/wget/>.

Please report bugs in Wget to <bug-wget@gnu.org>.

See the file `MAILING-LIST' for information about Wget mailing lists.
Wget's home page is at <http://www.gnu.org/software/wget/>.

If you would like to contribute code for Wget, please read
http://wget.addictivecode.org/PatchGuidelines.

Wget was originally written and mainained by Hrvoje Niksic.  Please see
the file AUTHORS for a list of major contributors, and the ChangeLogs
for a detailed listing of all contributions.


Copyright (C) 1995, 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004
2005, 2006, 2007, 2008, 2009 Free Software Foundation, Inc.

This program is free software; you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation; either version 3 of the License, or
(at your option) any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU General Public License for more details.

You should have received a copy of the GNU General Public License
along with this program; if not, write to the Free Software
Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301
USA.

Additional permission under GNU GPL version 3 section 7

If you modify this program, or any covered work, by linking or
combining it with the OpenSSL project's OpenSSL library (or a
modified version of that library), containing parts covered by the
terms of the OpenSSL or SSLeay licenses, the Free Software Foundation
grants you additional permission to convey the resulting work.
Corresponding Source for a non-source form of such a combination
shall include the source code for the parts of OpenSSL used as well
as that of the covered work.