PDFDataExtractor: A Tool for Reading Scientific Text and Interpreting Metadata from the Typeset Literature in the Portable Document Format

Zhu, M; Cole, JM

Cole, JM (通讯作者),Univ Cambridge, Dept Phys, Cavendish Lab, Cambridge CB3 0HE, England.;Cole, JM (通讯作者),STFC Rutherford Appleton Lab, ISIS Neutron & Muon Source, Harwell Sci & Innovat Campus, Didcot OX11 0QX, Oxon, England.;Cole, JM (通讯作者),Univ Cambridge, Dept Chem Engn & Biotechnol, Cambridge CB3 0AS, England.

JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022; 62 (7): 1633

Abstract

The layout of portable document format (PDF) files is constant to any screen, and the metadata therein are latent, compared to mark-up languages such ......

Full Text Link