Bibtex
Cite as text
@Select Types{,
Journal = "Band-1",
Title= "Using CNNs to Detect Graphical Representations of Structural Equation Models in IS Papers",
Author= "Tobias Genz, Burkhardt Funk Leuphana",
Doi= "https://doi.org/10.30844/wi_2020_a8-genz",
Abstract= "Literature reviews are an essential but time-consuming part of every research endeavor and play an important role in the quality of the research findings. Traditional tools and literature databases only make use of the textual information and do not consider graphical representations like figures of structural equation models (SEMs). These models are often used in empirical studies to visualize theoretical models and key results. We design and implement an application for image recognition to simplify the search for relevant papers, by automatically recognizing SEM figures in scientific papers stored as PDF files. To classify whether a page in a paper contains an SEM figure we make use of convolutional neural networks and achieve an F1 score of 98,7% together with a recall of 100% for the SEM class. We further describe how we intend to automatically extract information from these SEM figures.
",
Keywords= "Structural equation models, deep neural networks, information extraction, literature review",
}
Tobias Genz, Burkhardt Funk Leuphana: Using CNNs to Detect Graphical Representations of Structural Equation Models in IS Papers. Online: https://doi.org/10.30844/wi_2020_a8-genz (Abgerufen 24.11.24)
Open Access
Literature reviews are an essential but time-consuming part of every research endeavor and play an important role in the quality of the research findings. Traditional tools and literature databases only make use of the textual information and do not consider graphical representations like figures of structural equation models (SEMs). These models are often used in empirical studies to visualize theoretical models and key results. We design and implement an application for image recognition to simplify the search for relevant papers, by automatically recognizing SEM figures in scientific papers stored as PDF files. To classify whether a page in a paper contains an SEM figure we make use of convolutional neural networks and achieve an F1 score of 98,7% together with a recall of 100% for the SEM class. We further describe how we intend to automatically extract information from these SEM figures.
Structural equation models, deep neural networks, information extraction, literature review
1. Bong, C.H., Larsen, K.R., James, M.: A large scale knowledge integration leading to human decision making. In: 2012 IEEE Symposium on Computers Informatics (ISCI), pp. 22–27 (2012)
2. Urbach, N., Ahlemann, F.: Structural equation modeling in information systems research using partial least squares. Journal of Information technology theory and application 11, 5–40 (2010)
3. Larsen, K.R., Bong, C.H.: A Tool for Addressing Construct Identity in Literature Reviews and Meta-Analyses. MIS Quarterly 40, 529–551 (2016)
4. Gregor, S., Hevner, A.R.: Positioning and Presenting Design Science Research for Maximum Impact. MIS Q 37, 337–355 (2013)
5. Mueller, R., Abdullaev, S.: DeepCause: Hypothesis Extraction from Information Systems Papers with Deep Learning for Theory Ontology Learning. Proceedings of the 52nd Hawaii International Conference on System Sciences (2019)
6. Sturm, B., Sunyaev, A.: You Can’t Make Bricks Without Straw: Designing Systematic Literature Search Systems. In: ICIS 2017: Proceedings of the International Conference on Information Systems (2017)
7. Mueller, R.M., Huettemann, S.: Extracting Causal Claims from Information Systems Papers with Natural Language Processing for Theory Ontology Learning. Proceedings of the 51st Hawaii International Conference on System Sciences (2018)
8. Bosco, F., Steel, P., Oswald, F., Uggerslev, K., Field, J.: Cloud-based Metaanalysis to Bridge Science and Practice: Welcome to metaBUS. PAD 1 (2015)
9. Siegel, N., Lourie, N., Power, R., Ammar, W.: Extracting Scientific Figures with Distantly Supervised Neural Networks. Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries – JCDL 18, 223–232 (2018)
10. Clark, C., Divvala, S.: PDFFigures 2.0. Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries – JCDL 16, 143–152 (2016)
11. Singh, M., Barua, B., Palod, P., Garg, M., Satapathy, S., Bushi, S., Ayush, K., Sai Rohith, K., Gamidi, T., Goyal, P., et al.: OCR++: A Robust Framework For Information Extraction from Scholarly Articles. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, 3390–3400 (2016)
12. Councill, I.G., Giles, C.L., Kan, M.-Y.: ParsCit: an Open-source CRF Reference String Parsing Package. In: LREC (2008)
13. Ray Choudhury, S., Mitra, P., Giles, C.L.: Automatic Extraction of Figures from Scholarly Documents. Proceedings of the 2015 ACM Symposium on Document Engineering – DocEng 15, 47–50 (2015)
14. Stahl, C.G., Young, S.R., Herrmannova, D., Patton, R.M., Wells, J.C.: DeepPDF: A Deep Learning Approach to Extracting Text from PDFs (2018)
15. Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., et al.: ImageNet Large Scale Visual Recognition Challenge. Int. J. Comput. Vision 115, 211–252 (2015)
16. Chollet, F. and others: Keras (2015)
17. Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., Isard, M., et al.: TensorFlow: A System for Large-scale Machine Learning. In: Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation, pp. 265–283. USENIX Association, Berkeley, CA, USA (2016)
18. Andrew G. Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto and Hartwig Adam: MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications (2017)
19. He, K., Zhang, X., Ren, S., Sun, J.: Deep Residual Learning for Image Recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016)
20. Chollet, F.: Xception: Deep Learning with Depthwise Separable Convolutions. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1800–1807 (2017)
21. Wong, S.C., Gatt, A., Stamatescu, V., McDonnell, M.D.: Understanding Data Augmentation for Classification: When to Warp? 2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 1–6 (2016)
22. Zhao, Z.-Q., Zheng, P., Xu, S.-T., Wu, X.: Object Detection With Deep Learning: A Review. IEEE transactions on neural networks and learning systems (2019)