我总是在java程序中使用univocity解析器来比较csv文件。它工作得很好,速度快得多。
但问题是,这次我试图解析两个具有复杂值的不同大容量csv文件,并在新的csv文件中打印差异,
查看作者的一个示例,在将文件1读入列表并转换为映射后,我尝试使用processFile,但在解析时仍然出现错误。
下面是我的示例输入和预期输出文件。
输入-文件1
"h1","h2","h3","h4","h5"
"00000","US","9503.00.0089","USA","9503.0089"
"","EU","9503.00.7000","EUROPEAN UNION","9503.00.7000"
"#1200","US","5601.22.0010","USA","5601.22.0010"
"0180691","US","9503.00.0073","USA","9503.00.0073"
âDRTY01â,âCAâ,â9603.01.0088â,âCANâ,â9603.01.0088â
输入-文件2
"h1","h2","h3","h6","h7","h8","h9","h10",h11
"018890","US","","2015","101","1","1","All",ââ
"00000","US","9503.00.0090","1986","101","1","1","All","9503.00.0090"
"0180691","US","9503.00.0073","2019","101","1","1","All","9503.00.0073â
âDRTY01â,âCAâ,â9603.01.0087â,â2002â,â102â,â1â,â2â,âCAâ, â9603.01.0087â
在file1和file2中选择h1,h2公共值,然后比较file1的h3和file2的h3,如果两个文件h3不相等,那么我想打印-h1-h1,h4,h10,h5,h11,h6,h7,h7。到文件3
输出-文件3
âh1â,âh4â,â h10â,âh5â, âh11â,âh6â,âh7â,âh8â,âh9â
"00000","USAâ,âAllâ,â9503.00.0089â,â9503.00.0090â, "1986","101","1","1"
"DRTY01â,âCANâ,âCAâ,â9603.01.0088â,â9603.01.0087â,â2002â,â102â,â1â,â2â