Darshit Shah
e39be32838
* src/html-url.c(get_urls_html_fm): Add message in verbose mode with no-follow attribute
2019-02-19 23:25:05 +01:00
Darshit Shah
8b975e83d3
Update copyright statements
2019-02-10 11:50:17 +01:00
Tim Rühsen
23b0275feb
Add new HTML parser fuzzer
...
* fuzz/Makefile.am: Add wget_html_fuzzer
* fuzz/wget_html_fuzzer.c: New fuzzer
* fuzz/wget_html_fuzzer.dict: HTML dictionary for fuzzing
* fuzz/wget_html_fuzzer.in: Initial corpora
* src/html-url.c: Add new function get_urls_html_fm()
* src/html-url.h: Add ne function get_urls_html_fm()
* src/wget.h: Fix define for fopen_wgetrc()
2018-04-20 22:33:58 +02:00
Darshit Shah
183fccdaad
Update Copyright years
2018-01-14 11:24:43 +01:00
Tim Rühsen
a2c4849900
Fix crash on 'srcset' inline URIs
...
* src/html-url.c (tag_handle_img): Check append_url() for NULL
return value before dereference.
Crashed reproducable with parsing srcset="data:..." inline data.
Reported-by: Coverity
2016-09-09 11:44:02 +02:00
Tim Rühsen
964f4646da
* src/html-url.c (get_urls_html): Fix memory leak
2016-08-17 23:12:25 +02:00
Tim Rühsen
2bdfc4f521
Fix warnings for --disable-iri configure flag
...
* src/iri.h: Fix #define for parse_charset
* src/html-url.c: Surround some IRI code parts by #ifdef ENABLE_IRI
* src/http.c: Likewise
* src/iri.h: Likewise
* src/recur.c: Likewise
* src/retr.c: Likewise
2016-06-07 12:52:59 +02:00
Maks Orlovich
c28f51aadf
Parse <img srcset> attributes, they have image URLs.
...
* src/convert.h: Add link_noquote_html_p to permit rewriting URLs deep
inside attributes without adding extraneous quoting
* src/convert.c (convert_links): Honor link_noquote_html_p
* src/html_url.c (tag_handle_img): New function. Add srcset parsing.
2016-03-03 09:38:45 +01:00
Darshit Shah
53b22974cb
html-url.c: Fix potential memory leaks
...
Reported by: Coverity Bug 1188050
2015-03-14 16:48:30 +05:30
Giuseppe Scrivano
16f1fb1d1f
maint: update copyright year ranges to include 2015
2015-03-09 16:32:01 +01:00
Tim Ruehsen
4850e9c873
Replaced xfree_null() by xfree() and nullify argument after freeing.
2014-12-01 16:15:37 +01:00
Tim Rühsen
3c51ad7f02
Removed form feeds from sources and NEWS
2014-11-20 16:35:34 +01:00
Tim Rühsen
7b43510fe3
Fixes possible issues with Wget running in a turkish locale
2014-11-20 10:56:21 +01:00
Tim Rühsen
1356e90a14
Trivial fixes for C89 compliancy
2014-11-20 09:56:57 +01:00
Tim Rühsen
bc347cc36f
fixed IRI misbehaviour(s)
2014-10-20 08:53:12 +02:00
Tim Ruehsen
38a7829dcb
Fix compiler warnings
2014-05-12 12:18:50 +02:00
Giuseppe Scrivano
1fd32c995c
iri: Fix parsing of some URLs contained in HTML documents
2013-10-10 23:13:13 +02:00
Ciprian Vieru
6a2d67b583
Add HTML5 media tags to list of known tags and attributes
2013-06-25 23:49:03 +02:00
Rohit Mathulla
5dcb116087
Support shorthand URLs in an input file
2012-11-04 22:03:23 +01:00
Giuseppe Scrivano
06fc1edb54
Remove static modifier for functions used in other modules.
2012-08-28 21:38:12 +02:00
Tim Ruehsen
0aa3c5d33c
Fix some compiler warnings.
2012-05-05 15:24:35 +02:00
Gijs van Tulder
08a147c672
Fix a segfault on an incomplete STYLE tag.
2012-04-01 23:29:16 +02:00
Giuseppe Scrivano
ee6021a3b0
Fix GCC warnings.
2011-04-24 13:03:48 +02:00
Daniel Manrique
0db305126b
Set exit status when invalid host given
2011-04-23 15:31:41 -04:00
Giuseppe Scrivano
2f6aa1d741
mass change: update copyright years.
2011-01-01 13:19:37 +01:00
Manfred Koizar
cea1f0718d
Do not assume external links type to be always "text/html".
2010-10-18 10:55:16 +02:00
Ivanov Anton
62aab82ead
Fixed crash while processing page with invalid `style' attribute.
2010-10-08 13:39:36 +02:00
Giuseppe Scrivano
e095cc064e
Fix a problem when -k is specified and url's are specified in CSS code.
2010-07-30 01:00:26 +02:00
Giuseppe Scrivano
cf3c678c82
Rename two functions.
2010-07-09 12:24:51 +02:00
Giuseppe Scrivano
293008f682
Mass update copyright years.
2010-05-08 21:56:15 +02:00
Micah Cowan
4a08094db8
[mq]: cfg-mk
2009-09-21 20:39:44 -07:00
Micah Cowan
b9e9ad65cc
Ran update-copyright.
2009-09-04 00:13:47 -07:00
Micah Cowan
5febddec70
Fix meta name=robots.
2009-07-05 23:23:17 -07:00
Micah Cowan
18bca2706b
More quoting.
2009-07-02 01:04:11 -07:00
Micah Cowan
889f705d1b
Quote some things that needed quoting.
2009-06-29 21:37:03 -07:00
Saint Xavier
66dd4bda74
IRI requirement: do not percent-encode already percent-encoded values (try1)
2008-09-27 11:13:21 +02:00
Saint Xavier
26a3eea8e2
Removed commented *printf and use quote_n() for quoting several args
2008-08-15 15:15:42 +02:00
Xavier Saint
44a22c78a7
Automated merge.
2008-08-07 10:27:19 +02:00
Micah Cowan
0fae9cb388
stsc: better message for "unsupported schemes".
2008-08-03 22:03:04 -07:00
Saint Xavier
d82f80ecab
Change global variable model for state-object
2008-07-24 00:56:29 +02:00
Saint Xavier
b30a0dd817
Automated merge.
2008-07-20 21:45:09 +02:00
Saint Xavier
1e9ced0170
Get rid of the supplementary bool pointer in url_parse () arguments; UGLY :)
2008-07-20 18:47:52 +02:00
Saint Xavier
9a2ea3938d
Basic IDN/IRI support
2008-07-20 13:10:02 +02:00
Micah Cowan
2e2ac6ad2f
Merge current tip with CSS stuff.
2008-07-11 16:30:51 -07:00
Saint Xavier
13fec85566
Add "content-type" meta tag parsing for retrieving HTML page encoding.
2008-06-19 22:33:38 +02:00
Micah Cowan
dac53e8bdf
Minor tweaks to Henri's patch.
2008-05-31 00:24:53 -07:00
Micah Cowan
48b53471e8
Henri's -Wall patch.
2008-05-30 22:42:36 -07:00
Micah Cowan
caae3b70f4
Merging Ted Mielczarek's CSS changes with tip.
2008-04-22 01:28:15 -07:00
Micah Cowan
a0d0f332d5
Ted Mielczarek's CSS wonder-patch, applied against the source from around the time the patch was written.
2008-04-22 00:15:48 -07:00
Micah Cowan
bc8f473272
Updated copyright year.
2008-01-25 05:04:01 -08:00