Commit graph

130 commits

Author SHA1 Message Date
e3b10a994b
implement own charset system 2023-07-03 18:25:51 +02:00
f3dbf79927
parse version header 2023-07-03 17:05:17 +02:00
d64b7a0bbc
WIP mime headers 2023-07-03 11:40:02 +02:00
541c92f088
update todo list 2023-06-22 15:11:11 +02:00
5fda64477c
cargo fmt 2023-06-22 15:08:50 +02:00
99c6490eb2
remove obsolete fragments/header.rs 2023-06-22 15:05:52 +02:00
d30d38f124
fix enron test 2023-06-22 14:10:50 +02:00
14bcf119a7
re-enable parser 2023-06-22 12:21:18 +02:00
f7972f8fa5
move some tests from src/ to tests/ 2023-06-22 12:10:25 +02:00
39773d03ba
fix warnings 2023-06-22 11:15:01 +02:00
9d14868491
refactor parser! 2023-06-22 10:48:07 +02:00
34334398d8
Implement new parser in parse.rs 2023-06-20 18:58:07 +02:00
8e67af6c2c
borrow parsers (when it's possible) 2023-06-20 18:48:49 +02:00
2b7b5664c1
final section parser 2023-06-20 18:12:16 +02:00
458d6517d1
write an eager parser 2023-06-20 17:03:18 +02:00
e9c7da850e
refactor of lazy header parser 2023-06-20 15:56:06 +02:00
972ce658eb
new pass: field names 2023-06-20 11:49:00 +02:00
6c2ee3ee67
extract fields 2023-06-19 21:18:23 +02:00
a7efac53c2
wip cow 2023-06-19 19:03:33 +02:00
2bc62edba8
implement the guess charset parser 2023-06-19 18:31:24 +02:00
ddf6311925
segment parser is working 2023-06-19 18:18:15 +02:00
25bc3bb55c
refactor 2023-06-19 17:25:16 +02:00
34b146edaf
first idea on multipass 2023-06-19 17:03:49 +02:00
a8d798692d
add cli for enron 2023-06-19 16:10:02 +02:00
98f42d3efc
validate enron 2023-06-19 16:09:11 +02:00
849c741be9
kean-s ignore list 2023-06-19 15:24:22 +02:00
3415e10b37
fix typo readme 2023-06-19 12:15:45 +02:00
fc2f0943cf
add some implementation indicators 2023-06-19 12:15:05 +02:00
865305a793
filter kaminski+dasovich emails 2023-06-19 11:55:30 +02:00
4fe6c9c8df
wip enron, todo list 2023-06-19 11:22:51 +02:00
cad1ca9978
workaround when multiple at encountered 2023-06-18 22:48:54 +02:00
5083abb835
reorder testing 2023-06-18 22:17:55 +02:00
88fc3c54f2
fix another enron test 2023-06-18 22:11:02 +02:00
950947ee3e
add a charset detector 2023-06-18 22:05:11 +02:00
8f2c944ab8
remove println 2023-06-18 21:18:33 +02:00
92d657d380
add some emails, start debug 2023-06-18 21:17:27 +02:00
a88c4d2dcb
tested datetime 2023-06-18 18:28:50 +02:00
dd6f127fa2
second impl of datetime, with parser 2023-06-18 17:27:01 +02:00
a6dd1e1718
improve readme, wip datetime 2023-06-17 11:43:54 +02:00
1f5d9ebf78
rewrite datetime parsing 2023-06-16 18:16:55 +02:00
1527b41ab4
collect parsing errors 2023-06-16 17:24:51 +02:00
326c903c43
huge test 2023-06-16 15:04:04 +02:00
e56d5bef1d
obsolete domain list 2023-06-16 13:06:58 +02:00
10e2e586b2
add compatibility with obsolete syntax 2023-06-16 12:07:17 +02:00
0ff9466b5f
be more tolerant on field names 2023-06-16 11:41:42 +02:00
9f512ddc0e
better syntax 2023-06-16 11:07:04 +02:00
0e23e49199
rescue logic 2023-06-16 10:50:37 +02:00
c62f803a95
fix bug 2023-06-16 10:19:28 +02:00
486ccf2de0
drop my early implementation of trace 2023-06-16 09:58:07 +02:00
b6c25a4676
working trace test 2023-06-13 22:50:29 +02:00