htmlparser-user Mailing List for HTML Parser

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

If I recall correctly, implementing this feature would require deferring 
not only the connect but also the determination of the character set 
(from the header returned by the connect) and creation of the reader 
(because it needs the character set, and an input stream) until 
elements() is called.  elements() would need to check for a null reader 
and do the work. Then getReader() and getEncoding() would also have to 
handle a null reader or null character_set too. Are there other subtleties?

Maybe tricky, but probably do-able.  I think all the constructors have 
test cases.

But then, all that's really being saved is the user coding:

    Parser parser = new Parser ("http://yadda");
    URL url = parser.getConnection ();
    ...process the url as appropriate
    ... parser.elements ()

instead of:

    URL url = new URL ("http://yadda");
    url.openConnection ();
    ...process the url as appropriate
    Parser parser = new Parser (url);
    ... parser.elements ()

So it's probably not really worth the convoluted coding, unless I'm 
missing something in the use-case.

Derrick

htm...@li... wrote:

>
>Also, on another note, if I try to initialize the
>parser directly, I am unable to work with the
>URLConnection.  For example:
>
>    HttpURLConnection urlConn = null;
>    HTMLParser parser = new
>HTMLParser("http://somedomain/somepath");
>    urlConn =
>(HttpURLConnection)parser.getConnection();
>    urlConn.setDoInput(true);
>    // ...
>
>This code throws an exception because the HTTP request
>has already been made.
>
>Exception in thread "main"
>java.lang.IllegalAccessError: Already connected
>        at
>java.net.URLConnection.setDoInput(URLConnection.java:677)
>
>--- Bob Lewis <bob...@ya...> wrote:
>  
>
<snip>

>
>--__--__--
>
>Message: 3
>From: "Somik Raha" <so...@ya...>
>To: <htm...@li...>
>Subject: Re: [Htmlparser-user] Malformed Input Exception
>Date: Tue, 25 Feb 2003 22:46:16 -0800
>Reply-To: htm...@li...
>
>That sounds like a good feature request. Derrick ->what do you think ?
>
>Regards,
>Somik
>
>  
>
>
>  
>

2001	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov (1)	Dec
2002	Jan (7)	Feb	Mar (9)	Apr (50)	May (20)	Jun (47)	Jul (37)	Aug (32)	Sep (30)	Oct (11)	Nov (37)	Dec (47)
2003	Jan (31)	Feb (70)	Mar (67)	Apr (34)	May (66)	Jun (25)	Jul (48)	Aug (43)	Sep (58)	Oct (25)	Nov (10)	Dec (25)
2004	Jan (38)	Feb (17)	Mar (24)	Apr (25)	May (11)	Jun (6)	Jul (24)	Aug (42)	Sep (13)	Oct (17)	Nov (13)	Dec (44)
2005	Jan (10)	Feb (16)	Mar (16)	Apr (23)	May (6)	Jun (19)	Jul (39)	Aug (15)	Sep (40)	Oct (49)	Nov (29)	Dec (41)
2006	Jan (28)	Feb (24)	Mar (52)	Apr (41)	May (31)	Jun (34)	Jul (22)	Aug (12)	Sep (11)	Oct (11)	Nov (11)	Dec (4)
2007	Jan (39)	Feb (13)	Mar (16)	Apr (24)	May (13)	Jun (12)	Jul (21)	Aug (61)	Sep (31)	Oct (13)	Nov (32)	Dec (15)
2008	Jan (7)	Feb (8)	Mar (14)	Apr (12)	May (23)	Jun (20)	Jul (9)	Aug (6)	Sep (2)	Oct (7)	Nov (3)	Dec (2)
2009	Jan (5)	Feb (8)	Mar (10)	Apr (22)	May (85)	Jun (82)	Jul (45)	Aug (28)	Sep (26)	Oct (50)	Nov (8)	Dec (16)
2010	Jan (3)	Feb (11)	Mar (39)	Apr (56)	May (80)	Jun (64)	Jul (49)	Aug (48)	Sep (16)	Oct (3)	Nov (5)	Dec (5)
2011	Jan (13)	Feb	Mar (1)	Apr (7)	May (7)	Jun (7)	Jul (7)	Aug (8)	Sep	Oct (6)	Nov (2)	Dec
2012	Jan (5)	Feb	Mar (3)	Apr (3)	May (4)	Jun (8)	Jul (1)	Aug (5)	Sep (10)	Oct (3)	Nov (2)	Dec (4)
2013	Jan (4)	Feb (2)	Mar (7)	Apr (7)	May (6)	Jun (7)	Jul (3)	Aug	Sep (1)	Oct	Nov	Dec
2014	Jan	Feb (2)	Mar (1)	Apr	May (3)	Jun (1)	Jul	Aug	Sep (1)	Oct (4)	Nov (2)	Dec (4)
2015	Jan (4)	Feb (2)	Mar (8)	Apr (7)	May (6)	Jun (7)	Jul (3)	Aug (1)	Sep (1)	Oct (4)	Nov (3)	Dec (4)
2016	Jan (4)	Feb (6)	Mar (9)	Apr (9)	May (6)	Jun (1)	Jul (1)	Aug	Sep	Oct (1)	Nov (1)	Dec (1)
2017	Jan	Feb (1)	Mar (3)	Apr (1)	May	Jun (1)	Jul (2)	Aug (3)	Sep (6)	Oct (3)	Nov (2)	Dec (5)
2018	Jan (3)	Feb (13)	Mar (28)	Apr (5)	May (4)	Jun (2)	Jul (2)	Aug (8)	Sep (2)	Oct (1)	Nov (5)	Dec (1)
2019	Jan (8)	Feb (1)	Mar	Apr (1)	May (4)	Jun	Jul (1)	Aug	Sep	Oct	Nov (2)	Dec (2)
2020	Jan	Feb	Mar (1)	Apr (1)	May (1)	Jun (2)	Jul (1)	Aug (1)	Sep (1)	Oct	Nov (1)	Dec (1)
2021	Jan (3)	Feb (2)	Mar (1)	Apr (1)	May (2)	Jun (1)	Jul (2)	Aug (1)	Sep	Oct	Nov	Dec
2022	Jan	Feb	Mar	Apr (1)	May (1)	Jun (1)	Jul	Aug (1)	Sep	Oct	Nov	Dec
2023	Jan (2)	Feb	Mar	Apr	May	Jun	Jul	Aug (1)	Sep	Oct	Nov	Dec
2024	Jan (2)	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2025	Jan	Feb	Mar	Apr	May	Jun (1)	Jul	Aug	Sep	Oct (1)	Nov	Dec

S	M	T	W	T	F	S
						1
2	3 (1)	4 (4)	5 (7)	6 (7)	7 (8)	8 (2)
9	10 (1)	11 (3)	12 (2)	13 (1)	14 (4)	15
16 (3)	17 (2)	18 (3)	19 (6)	20	21	22 (1)
23 (2)	24 (4)	25 (2)	26 (5)	27 (2)	28

htmlparser-user Mailing List for HTML Parser

htmlparser-user — The user mailing list for users of the htmlparser library