Jiang, Congfeng; Liu, Junming; Ou, Dongyang; Wang, Yumei; … - In: Journal of Database Management (JDM) 29 (2018) 2, pp. 1-22
. On top of PDFBox they built their own pipeline program, namely, PAXAT, to implement their approaches for metadata … extraction. 10177 papers from arXiv, ACM, ACL and other publicly accessed and institution-subscribed sources are tested. The …