Commit Graph

108 Commits

Author SHA1 Message Date
Darshit Shah
e39be32838 * src/html-url.c(get_urls_html_fm): Add message in verbose mode with no-follow attribute 2019-02-19 23:25:05 +01:00
Darshit Shah
8b975e83d3 Update copyright statements 2019-02-10 11:50:17 +01:00
Tim Rühsen
23b0275feb Add new HTML parser fuzzer
* fuzz/Makefile.am: Add wget_html_fuzzer
* fuzz/wget_html_fuzzer.c: New fuzzer
* fuzz/wget_html_fuzzer.dict: HTML dictionary for fuzzing
* fuzz/wget_html_fuzzer.in: Initial corpora
* src/html-url.c: Add new function get_urls_html_fm()
* src/html-url.h: Add ne function get_urls_html_fm()
* src/wget.h: Fix define for fopen_wgetrc()
2018-04-20 22:33:58 +02:00
Darshit Shah
183fccdaad Update Copyright years 2018-01-14 11:24:43 +01:00
Tim Rühsen
a2c4849900 Fix crash on 'srcset' inline URIs
* src/html-url.c (tag_handle_img): Check append_url() for NULL
  return value before dereference.

Crashed reproducable with parsing srcset="data:..." inline data.
Reported-by: Coverity
2016-09-09 11:44:02 +02:00
Tim Rühsen
964f4646da * src/html-url.c (get_urls_html): Fix memory leak 2016-08-17 23:12:25 +02:00
Tim Rühsen
2bdfc4f521 Fix warnings for --disable-iri configure flag
* src/iri.h: Fix #define for parse_charset
* src/html-url.c: Surround some IRI code parts by #ifdef ENABLE_IRI
* src/http.c: Likewise
* src/iri.h: Likewise
* src/recur.c: Likewise
* src/retr.c: Likewise
2016-06-07 12:52:59 +02:00
Maks Orlovich
c28f51aadf Parse <img srcset> attributes, they have image URLs.
* src/convert.h: Add link_noquote_html_p to permit rewriting URLs deep
                 inside attributes without adding extraneous quoting
* src/convert.c (convert_links): Honor link_noquote_html_p
* src/html_url.c (tag_handle_img): New function. Add srcset parsing.
2016-03-03 09:38:45 +01:00
Darshit Shah
53b22974cb html-url.c: Fix potential memory leaks
Reported by: Coverity Bug 1188050
2015-03-14 16:48:30 +05:30
Giuseppe Scrivano
16f1fb1d1f maint: update copyright year ranges to include 2015 2015-03-09 16:32:01 +01:00
Tim Ruehsen
4850e9c873 Replaced xfree_null() by xfree() and nullify argument after freeing. 2014-12-01 16:15:37 +01:00
Tim Rühsen
3c51ad7f02 Removed form feeds from sources and NEWS 2014-11-20 16:35:34 +01:00
Tim Rühsen
7b43510fe3 Fixes possible issues with Wget running in a turkish locale 2014-11-20 10:56:21 +01:00
Tim Rühsen
1356e90a14 Trivial fixes for C89 compliancy 2014-11-20 09:56:57 +01:00
Tim Rühsen
bc347cc36f fixed IRI misbehaviour(s) 2014-10-20 08:53:12 +02:00
Tim Ruehsen
38a7829dcb Fix compiler warnings 2014-05-12 12:18:50 +02:00
Giuseppe Scrivano
1fd32c995c iri: Fix parsing of some URLs contained in HTML documents 2013-10-10 23:13:13 +02:00
Ciprian Vieru
6a2d67b583 Add HTML5 media tags to list of known tags and attributes 2013-06-25 23:49:03 +02:00
Rohit Mathulla
5dcb116087 Support shorthand URLs in an input file 2012-11-04 22:03:23 +01:00
Giuseppe Scrivano
06fc1edb54 Remove static modifier for functions used in other modules. 2012-08-28 21:38:12 +02:00
Tim Ruehsen
0aa3c5d33c Fix some compiler warnings. 2012-05-05 15:24:35 +02:00
Gijs van Tulder
08a147c672 Fix a segfault on an incomplete STYLE tag. 2012-04-01 23:29:16 +02:00
Giuseppe Scrivano
ee6021a3b0 Fix GCC warnings. 2011-04-24 13:03:48 +02:00
Daniel Manrique
0db305126b Set exit status when invalid host given 2011-04-23 15:31:41 -04:00
Giuseppe Scrivano
2f6aa1d741 mass change: update copyright years. 2011-01-01 13:19:37 +01:00
Manfred Koizar
cea1f0718d Do not assume external links type to be always "text/html". 2010-10-18 10:55:16 +02:00
Ivanov Anton
62aab82ead Fixed crash while processing page with invalid `style' attribute. 2010-10-08 13:39:36 +02:00
Giuseppe Scrivano
e095cc064e Fix a problem when -k is specified and url's are specified in CSS code. 2010-07-30 01:00:26 +02:00
Giuseppe Scrivano
cf3c678c82 Rename two functions. 2010-07-09 12:24:51 +02:00
Giuseppe Scrivano
293008f682 Mass update copyright years. 2010-05-08 21:56:15 +02:00
Micah Cowan
4a08094db8 [mq]: cfg-mk 2009-09-21 20:39:44 -07:00
Micah Cowan
b9e9ad65cc Ran update-copyright. 2009-09-04 00:13:47 -07:00
Micah Cowan
5febddec70 Fix meta name=robots. 2009-07-05 23:23:17 -07:00
Micah Cowan
18bca2706b More quoting. 2009-07-02 01:04:11 -07:00
Micah Cowan
889f705d1b Quote some things that needed quoting. 2009-06-29 21:37:03 -07:00
Saint Xavier
66dd4bda74 IRI requirement: do not percent-encode already percent-encoded values (try1) 2008-09-27 11:13:21 +02:00
Saint Xavier
26a3eea8e2 Removed commented *printf and use quote_n() for quoting several args 2008-08-15 15:15:42 +02:00
Xavier Saint
44a22c78a7 Automated merge. 2008-08-07 10:27:19 +02:00
Micah Cowan
0fae9cb388 stsc: better message for "unsupported schemes". 2008-08-03 22:03:04 -07:00
Saint Xavier
d82f80ecab Change global variable model for state-object 2008-07-24 00:56:29 +02:00
Saint Xavier
b30a0dd817 Automated merge. 2008-07-20 21:45:09 +02:00
Saint Xavier
1e9ced0170 Get rid of the supplementary bool pointer in url_parse () arguments; UGLY :) 2008-07-20 18:47:52 +02:00
Saint Xavier
9a2ea3938d Basic IDN/IRI support 2008-07-20 13:10:02 +02:00
Micah Cowan
2e2ac6ad2f Merge current tip with CSS stuff. 2008-07-11 16:30:51 -07:00
Saint Xavier
13fec85566 Add "content-type" meta tag parsing for retrieving HTML page encoding. 2008-06-19 22:33:38 +02:00
Micah Cowan
dac53e8bdf Minor tweaks to Henri's patch. 2008-05-31 00:24:53 -07:00
Micah Cowan
48b53471e8 Henri's -Wall patch. 2008-05-30 22:42:36 -07:00
Micah Cowan
caae3b70f4 Merging Ted Mielczarek's CSS changes with tip. 2008-04-22 01:28:15 -07:00
Micah Cowan
a0d0f332d5 Ted Mielczarek's CSS wonder-patch, applied against the source from around the time the patch was written. 2008-04-22 00:15:48 -07:00
Micah Cowan
bc8f473272 Updated copyright year. 2008-01-25 05:04:01 -08:00