[Most-ai-contest] The performance of the last version FGC QA system
范正忠
jjfan於iis.sinica.edu.tw
Fri 4月 3 17:25:53 CST 2020
Dear all,
A quick look at Date-Duration on 'test' & 'dev' data set and the results as follows.
You can see
1. some answers are ranked Top-2 or Top-3
2. Some need math. operations
3. Some need to format transform (西元 -> 民國)
4. D267Q07 -> I think it should be correct since only "戰國末年" occurs in the passage. (golden answer : 戰國時期)
5. D303Q05 -> "初冬" should be OK since only "初冬" occurs in the passage (golden answer: 冬)
jjfan
Dev D061Q04 D073Q05 D097Q06 D103Q04 D241Q04 D247Q08 D307Q03
Top-2 Top-2 Top-2 Top1 相減 "1506年至1626年" Dispatcher Error : Single-Span -> Date-Duration
Test D033Q11 D069Q04 D069Q05 D069Q06 D069Q07 D087Q02 D105Q02 D105Q04 D117Q01 D117Q02 D243Q07 D267Q07 D303Q05
Top-2 Top1 相減 "1981年至纽约大学就读电影制作研究所,于1984年" Top1 - Top2 Top1 前頭加上 2019 年 Top1 1995年, output formater 沒有轉成民國 Top-2 Single-Span Top1 Top2 相減 "1506年4月18日,完成于1626年11月18日" Top1 "戰國末年" Top3 "初冬"
From: "范正忠" <jjfan at iis.sinica.edu.tw>
To: "Simonc" <simonc at iis.sinica.edu.tw>
Cc: "Most-ai Contest" <Most-ai-contest at iis.sinica.edu.tw>
Sent: Friday, April 3, 2020 4:45:39 PM
Subject: [Most-ai-contest] The performance of the last version FGC QA system
Dear all,
Enclosed please find the current performance of the FGC QA system.
Single-Span Kuo Multi-Span Date-Duration Train Dev Test
4月2日 ENSEMBLEModule: 12 models (new) train: 0.890
dev: 0.801
test: 0.750 train: 0.899
dev: 0.699
test: 0.741 train: 0.493
dev: 0.476
test: 0.350 train: 0.589
dev: 0.720
test: 0.581 0.786 0.709 0.658
4月3日 ENSEMBLEModule: 12 models (new)
MSPE_v18_branchy27
date_duration_module_4 train: 0.890
dev: 0.801
test: 0.750 train: 0.899
dev: 0.699
test: 0.741 train: 0.548
dev: 0.667
test: 0.450 train: 0.589
dev: 0.720
test: 0.581 0.791 0.725 0.668
Please 郭家銍 help to check Date-Duration ver.4.
Best,
jjfan
From: "范正忠" <jjfan at iis.sinica.edu.tw>
To: "Simonc" <simonc at iis.sinica.edu.tw>
Cc: "Most-ai Contest" <Most-ai-contest at iis.sinica.edu.tw>
Sent: Thursday, April 2, 2020 3:56:36 PM
Subject: Re: [Most-ai-contest] Updated Performance Data with Gold Answer Inclusion Rate
Dear all,
Enclosed please find the performance of the last code:
ENSEMBLEModule: 12 models (new), single_span_multi_hop_v2_1
Multi-Spans: MSPE_v18_branchy20
date_duration_module_3
arithmetic_module_3
kinship_module6
Overall:
train -> total: 882, correct: 668, accuracy: 0.757
dev -> total: 247, correct: 161, accuracy: 0.652
test -> total: 193, correct: 126, accuracy: 0.653
Single-Span:
train -> portion:0.619, count:546, errors: 86, accuracy: 0.842
ensemble only -> portion:0.619, count:546, errors: 37, accuracy: 0.932
multi-hop only -> portion:0.619, count:546, errors: 76, accuracy: 0.861
dev -> portion:0.632, count:156, errors: 45, accuracy: 0.712
ensemble only -> portion:0.632, count:156, errors: 37, accuracy: 0.763
multi-hop only -> portion:0.632, count:156, errors: 43, accuracy: 0.724
test -> portion:0.580, count:112, errors: 29, accuracy: 0.741
ensemble only -> portion:0.580, count:112, errors: 23, accuracy: 0.795
multi-hop only -> portion:0.580, count:112, errors: 26, accuracy: 0.768
Multi-Spans:
train -> portion:0.083, count:73, errors: 37, accuracy: 0.493
dev -> portion:0.085, count:21, errors: 11, accuracy: 0.476
test -> portion:0.104, count:20, errors: 13, accuracy: 0.350
jjfan
From: "Simonc" <simonc at iis.sinica.edu.tw>
To: "Most-ai Contest" <Most-ai-contest at iis.sinica.edu.tw>
Sent: Tuesday, March 31, 2020 6:00:52 PM
Subject: [Most-ai-contest] Updated Performance Data with Gold Answer Inclusion Rate
Dear all,
The attached file contains the performance data from our results in 3/27.
This time, the gold answer inclusion rate is also included. (That is, counting the cases where the correct answer is included in the answer candidates.)
Regards,
張光瑜
_______________________________________________
Most-ai-contest mailing list
Most-ai-contest at iis.sinica.edu.tw
https://www.iis.sinica.edu.tw/mailman/listinfo/most-ai-contest
_______________________________________________
Most-ai-contest mailing list
Most-ai-contest at iis.sinica.edu.tw
https://www.iis.sinica.edu.tw/mailman/listinfo/most-ai-contest
_______________________________________________
Most-ai-contest mailing list
Most-ai-contest at iis.sinica.edu.tw
https://www.iis.sinica.edu.tw/mailman/listinfo/most-ai-contest
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.iis.sinica.edu.tw/pipermail/most-ai-contest/attachments/20200403/a1d6eac1/attachment-0001.html>
More information about the Most-ai-contest
mailing list