Adam Dickmeiss [Tue, 10 Sep 2013 12:18:54 +0000 (14:18 +0200)]
Careful check of </script> termination
Adam Dickmeiss [Tue, 10 Sep 2013 11:58:43 +0000 (13:58 +0200)]
Try to reduce splitting of text nodes
Adam Dickmeiss [Tue, 10 Sep 2013 11:43:27 +0000 (13:43 +0200)]
No-nest mode for script content in HTML parser MP-486
This is not to be confused with quoted-literal which says that
only things in quotes should be rewritten.
Adam Dickmeiss [Tue, 10 Sep 2013 10:32:39 +0000 (12:32 +0200)]
Remove old header match code
Adam Dickmeiss [Thu, 5 Sep 2013 12:50:07 +0000 (14:50 +0200)]
bounce: don't use strdup for fixed length content
Error occurred in echo mode.
Adam Dickmeiss [Thu, 5 Sep 2013 09:29:41 +0000 (11:29 +0200)]
session_shared: expire backend class when no instances left
Adam Dickmeiss [Thu, 5 Sep 2013 08:42:33 +0000 (10:42 +0200)]
cql_rpn: use path for cql2rpn file MP-481
Adam Dickmeiss [Wed, 4 Sep 2013 13:39:20 +0000 (15:39 +0200)]
virt_db: Fix leak of naked ptr
Adam Dickmeiss [Tue, 3 Sep 2013 13:51:53 +0000 (15:51 +0200)]
Handle SRU queryType (YAZ 5)
Adam Dickmeiss [Thu, 29 Aug 2013 14:36:17 +0000 (16:36 +0200)]
Version 1.3.61
Adam Dickmeiss [Fri, 23 Aug 2013 12:34:03 +0000 (14:34 +0200)]
Ignore libmetaproxy4.dbg
Adam Dickmeiss [Thu, 22 Aug 2013 11:58:28 +0000 (11:58 +0000)]
Add ports.d/routes.d to RPM dir
Adam Dickmeiss [Thu, 22 Aug 2013 10:53:09 +0000 (12:53 +0200)]
Package metaproxy introduces ports.d/routes.d include
This allows custom ports and routes to be added by 3rd party.
Adam Dickmeiss [Wed, 21 Aug 2013 11:58:11 +0000 (13:58 +0200)]
Version 1.3.60-2indexdata
Adam Dickmeiss [Wed, 21 Aug 2013 11:21:36 +0000 (13:21 +0200)]
Version 1.3.60
Adam Dickmeiss [Wed, 21 Aug 2013 11:07:12 +0000 (13:07 +0200)]
Depend on boost.m4 for MP-475
Adam Dickmeiss [Wed, 21 Aug 2013 11:03:15 +0000 (13:03 +0200)]
Update NEWS
Adam Dickmeiss [Mon, 19 Aug 2013 12:09:55 +0000 (14:09 +0200)]
HTML: clean up char checks. Add ptr to HTML parse spec
Adam Dickmeiss [Mon, 19 Aug 2013 11:40:04 +0000 (13:40 +0200)]
HTML parser stops eating tag when <-char is met
Heikki Levanto [Fri, 2 Aug 2013 12:21:54 +0000 (14:21 +0200)]
rewrite: do not even compare content types, if we don't have any
the regex_match caused a segfault if we had no content-type in the
headers.
Heikki Levanto [Tue, 23 Jul 2013 08:56:29 +0000 (10:56 +0200)]
More logging when deciding not to rewrite
Heikki Levanto [Mon, 22 Jul 2013 09:40:49 +0000 (11:40 +0200)]
within header=... matches only the header value
Used to match the whole header line, but we need to use
^$ anchoring for host-relative regexps, and they need to
anchor at the ends of the value. MP-470.
Adam Dickmeiss [Fri, 19 Jul 2013 12:26:04 +0000 (14:26 +0200)]
Add schema for http_rewrite filter
Adam Dickmeiss [Thu, 18 Jul 2013 09:14:26 +0000 (11:14 +0200)]
Ruleset functional MP-469
Adam Dickmeiss [Wed, 17 Jul 2013 13:21:59 +0000 (15:21 +0200)]
Started work on ,-separated rule set
Adam Dickmeiss [Fri, 12 Jul 2013 08:59:20 +0000 (10:59 +0200)]
Version 1.3.59
Adam Dickmeiss [Fri, 12 Jul 2013 08:51:26 +0000 (10:51 +0200)]
Update NEWS
Adam Dickmeiss [Fri, 12 Jul 2013 08:44:24 +0000 (10:44 +0200)]
Merge branch 'master' of ssh://git.indexdata.com/home/git/pub/metaproxy
Heikki Levanto [Thu, 11 Jul 2013 11:37:58 +0000 (13:37 +0200)]
Add echo option to the bounce filter
Adam Dickmeiss [Thu, 11 Jul 2013 11:11:47 +0000 (13:11 +0200)]
session_shared: resolve result-set references MP-292
The reference is substitued with the query for old set if database
match and query type is Type-1. If not the reference will issue a
diagnostic.
Adam Dickmeiss [Thu, 11 Jul 2013 08:30:39 +0000 (10:30 +0200)]
Require YAZ 4.2.62 due to new utilities in use
Adam Dickmeiss [Tue, 9 Jul 2013 13:05:54 +0000 (15:05 +0200)]
http_client: better diagnostics MP-455
Adam Dickmeiss [Tue, 9 Jul 2013 13:05:37 +0000 (15:05 +0200)]
New utility create_HTTP_Response_details
Adam Dickmeiss [Tue, 9 Jul 2013 11:38:00 +0000 (11:38 +0000)]
Add HTMLParserEvent virtual destructor
Adam Dickmeiss [Tue, 9 Jul 2013 11:08:16 +0000 (13:08 +0200)]
session_shared: negotiate init sz parameters MP-465
By default, session_shared will honor the paramters given by client.
An alternative, but not implemented, algorithm would be to only
re-use sessions with same sz parameters. This however, could reduce
re-use of sessions a lot. Hence this, simple algorithm instead. The
negotiation to a maximum sz values can be enabled by using
<init maximum-record-size="sz1" preferred-message-size="sz2"/>
in configuration. This will make session_shared send those values
to backends and use returned init values in negotaition to
client before session_shared.
Adam Dickmeiss [Mon, 8 Jul 2013 13:23:49 +0000 (15:23 +0200)]
More appropriate HTML comment handling
Adam Dickmeiss [Mon, 8 Jul 2013 12:22:35 +0000 (14:22 +0200)]
Use yaz_xml_include_glob MP-459
Adam Dickmeiss [Mon, 8 Jul 2013 09:15:18 +0000 (09:15 +0000)]
Avoid assign empty string for content regex
Adam Dickmeiss [Mon, 8 Jul 2013 08:20:48 +0000 (10:20 +0200)]
header and attr are regex. Case insensitive compare
Adam Dickmeiss [Fri, 5 Jul 2013 13:43:52 +0000 (15:43 +0200)]
Using regex for comparing tags
Adam Dickmeiss [Fri, 5 Jul 2013 12:36:04 +0000 (14:36 +0200)]
Deal with " inside embedded JS
Adam Dickmeiss [Fri, 5 Jul 2013 11:43:06 +0000 (13:43 +0200)]
Honor within type="quoted-literal"
Adam Dickmeiss [Fri, 5 Jul 2013 10:32:14 +0000 (12:32 +0200)]
Refactor content parse to Content::parse
Adam Dickmeiss [Fri, 5 Jul 2013 09:20:25 +0000 (11:20 +0200)]
Simplify test_patterns function
Adam Dickmeiss [Thu, 4 Jul 2013 14:43:27 +0000 (16:43 +0200)]
Don't go beyond end of string for quoted literals
Adam Dickmeiss [Thu, 4 Jul 2013 14:42:48 +0000 (16:42 +0200)]
Test for JS comments
Adam Dickmeiss [Thu, 4 Jul 2013 14:16:54 +0000 (16:16 +0200)]
http_rewrite: quoted literals
Adam Dickmeiss [Wed, 3 Jul 2013 12:12:16 +0000 (14:12 +0200)]
Simplify http rewrite test a bit
Adam Dickmeiss [Wed, 3 Jul 2013 12:03:09 +0000 (14:03 +0200)]
http_rewrite: content areas
One for type=html and one for type=headers.
Adam Dickmeiss [Tue, 2 Jul 2013 13:41:06 +0000 (15:41 +0200)]
HTML parser: elements starts with [A-Za-z]
Adam Dickmeiss [Tue, 2 Jul 2013 12:27:42 +0000 (14:27 +0200)]
Skip start anchor for text sections entirely
Adam Dickmeiss [Tue, 2 Jul 2013 11:24:34 +0000 (13:24 +0200)]
http_rewrite: fix start anchor (^) match
Adam Dickmeiss [Tue, 2 Jul 2013 09:03:08 +0000 (11:03 +0200)]
http_client: disables HTTP redirect by default
It can be configured with max-redirects in configuration.
Adam Dickmeiss [Mon, 1 Jul 2013 14:20:54 +0000 (16:20 +0200)]
filter_http_rewrite: don't crash if "Host" header is missing
Adam Dickmeiss [Mon, 1 Jul 2013 13:46:43 +0000 (15:46 +0200)]
Testing for all Replace rules in use
Adam Dickmeiss [Mon, 1 Jul 2013 13:46:01 +0000 (15:46 +0200)]
Use all Replaces from/to in each buffer
Adam Dickmeiss [Mon, 1 Jul 2013 13:23:23 +0000 (15:23 +0200)]
boost::smatch part of Replace object too
Adam Dickmeiss [Mon, 1 Jul 2013 13:12:44 +0000 (15:12 +0200)]
Refactor: compiled regex part of Replace object
Adam Dickmeiss [Mon, 1 Jul 2013 12:05:04 +0000 (14:05 +0200)]
Remove trailing whitespace
Adam Dickmeiss [Mon, 1 Jul 2013 12:04:31 +0000 (14:04 +0200)]
Option attribute values for HTML parser
Adam Dickmeiss [Thu, 27 Jun 2013 14:40:50 +0000 (16:40 +0200)]
HTMLParser more forgiving with bad attributes
The idea is not to fix, but at least passthru as text.
Adam Dickmeiss [Thu, 27 Jun 2013 12:33:01 +0000 (14:33 +0200)]
HTMLParserEvent, attributes takes quoting sep
as argument, so that quoting (if any) is preserved.
Adam Dickmeiss [Thu, 27 Jun 2013 12:23:02 +0000 (14:23 +0200)]
Event.text for whitespace in <x/ > tags
Adam Dickmeiss [Thu, 27 Jun 2013 11:32:08 +0000 (13:32 +0200)]
Set vars in replacement also if empty
Adam Dickmeiss [Thu, 27 Jun 2013 11:31:46 +0000 (13:31 +0200)]
Reformat a bit
Adam Dickmeiss [Thu, 27 Jun 2013 10:28:25 +0000 (12:28 +0200)]
Stack up attr="#text" things
Adam Dickmeiss [Thu, 27 Jun 2013 09:50:23 +0000 (11:50 +0200)]
Only do CDATA for attr="#text", verbose flag
Adam Dickmeiss [Thu, 27 Jun 2013 09:50:04 +0000 (11:50 +0200)]
More tests for html_parser
Adam Dickmeiss [Thu, 27 Jun 2013 09:49:45 +0000 (11:49 +0200)]
HTMLParser, verbose setting
Adam Dickmeiss [Wed, 26 Jun 2013 14:58:10 +0000 (16:58 +0200)]
Revise HTML parser; keep spelling
Fixes for DTD/PIs handling. 0-copy for tags and attributes (no
limit). No debug output, unless verbose is set for HTMLParser.
Adam Dickmeiss [Wed, 26 Jun 2013 12:33:01 +0000 (14:33 +0200)]
No dupe function (leaks)
Adam Dickmeiss [Wed, 26 Jun 2013 12:29:58 +0000 (14:29 +0200)]
Reformat
Adam Dickmeiss [Wed, 26 Jun 2013 12:28:38 +0000 (14:28 +0200)]
Reformat of HTMLparser source
Adam Dickmeiss [Wed, 26 Jun 2013 12:19:55 +0000 (14:19 +0200)]
http_rewrite using HTML parser
Adam Dickmeiss [Tue, 25 Jun 2013 14:47:02 +0000 (16:47 +0200)]
Deciding rules for within
Not really tested yet.
Adam Dickmeiss [Tue, 25 Jun 2013 14:46:53 +0000 (16:46 +0200)]
Remove unused code, fix printing
Adam Dickmeiss [Tue, 25 Jun 2013 13:21:15 +0000 (15:21 +0200)]
Refactor a bit
Adam Dickmeiss [Fri, 21 Jun 2013 12:51:07 +0000 (14:51 +0200)]
http_rewrite: HTMLParser boilerplate
Adam Dickmeiss [Fri, 21 Jun 2013 12:30:09 +0000 (14:30 +0200)]
Remove redundant include
Adam Dickmeiss [Fri, 21 Jun 2013 12:28:56 +0000 (14:28 +0200)]
http_rewrite: rename Section -> Phase
Adam Dickmeiss [Thu, 20 Jun 2013 13:12:13 +0000 (15:12 +0200)]
http_rewrite: parsing of new configuration
but at this point, still no HTML parsing and only first
"within" rule is being executed everywhere.
Adam Dickmeiss [Wed, 19 Jun 2013 13:06:44 +0000 (15:06 +0200)]
Add http rewrite objects for windows
Adam Dickmeiss [Wed, 19 Jun 2013 10:47:12 +0000 (12:47 +0200)]
http_file: raw mode functional
and schema updated accordingly
Adam Dickmeiss [Wed, 19 Jun 2013 09:44:09 +0000 (11:44 +0200)]
Merge branch 'master' into http_file_raw
Adam Dickmeiss [Wed, 19 Jun 2013 09:42:50 +0000 (11:42 +0200)]
sort list of filters
Adam Dickmeiss [Wed, 19 Jun 2013 09:26:21 +0000 (11:26 +0200)]
http_rewrite: Avoid ?<group> regex'es.
They don't work in Boost 1.33 (rather old).
Adam Dickmeiss [Wed, 19 Jun 2013 08:59:38 +0000 (08:59 +0000)]
Reformat: remove trailing whitespace
Adam Dickmeiss [Wed, 19 Jun 2013 08:58:50 +0000 (08:58 +0000)]
string offsets of type size_t
Adam Dickmeiss [Wed, 19 Jun 2013 08:47:33 +0000 (08:47 +0000)]
Fix warning
Adam Dickmeiss [Wed, 19 Jun 2013 08:26:59 +0000 (10:26 +0200)]
Merge branch 'master' into rewrite-filter
Adam Dickmeiss [Mon, 17 Jun 2013 12:49:22 +0000 (14:49 +0200)]
filter sort relays RecordComposition
including elementSet in use for search.
Adam Dickmeiss [Mon, 17 Jun 2013 12:48:51 +0000 (14:48 +0200)]
Add util piggyback_to_RecordComposition
Adam Dickmeiss [Fri, 14 Jun 2013 08:42:02 +0000 (10:42 +0200)]
Remove useless comment in retrievalinfo schema
Heikki Levanto [Thu, 13 Jun 2013 10:57:45 +0000 (12:57 +0200)]
Fix src/Makefile.am after merge
Heikki Levanto [Thu, 13 Jun 2013 10:16:03 +0000 (12:16 +0200)]
Merge remote-tracking branch 'origin/master' into rewrite-filter
Adam Dickmeiss [Thu, 13 Jun 2013 09:57:17 +0000 (11:57 +0200)]
Build-in filters part of libmetaproxy so again
And registration takes place in FactoryStatic . Not really
elegant.
Adam Dickmeiss [Wed, 12 Jun 2013 19:49:23 +0000 (21:49 +0200)]
Version 1.3.58
Adam Dickmeiss [Wed, 12 Jun 2013 19:41:57 +0000 (21:41 +0200)]
Fix load of filters cql_rpn, sru_z3950
Adam Dickmeiss [Wed, 12 Jun 2013 13:44:10 +0000 (15:44 +0200)]
Version 1.3.57
Adam Dickmeiss [Wed, 12 Jun 2013 13:41:51 +0000 (15:41 +0200)]
No longer build for Ubuntu natty