Actas de congresos
Automatic Interpretation Biodiversity Spreadsheets Based On Recognition Of Construction Patterns
Iceis 2014 - Proceedings Of The 16th International Conference On Enterprise Information Systems. Scitepress, v. 3, n. , p. 57 - 68, 2014.
Spreadsheets are widely adopted as "popular databases", where authors shape their solutions interactively. Although spreadsheets have characteristics that facilitate their adaptation by the author, they are not designed to integrate data across independent spreadsheets. In biology, we observed a significant amount of biodiversity data in spreadsheets treated as isolated entities with different tabular organizations, but with high potential for data articulation. In order to promote interoperability among these spreadsheets, we propose in this paper a technique based on pattern recognition of spreadsheets belonging to the biodiversity domain. It can be exploited to identify the spreadsheet in a higher level of abstraction - e.g., it is possible to identify the nature a spreadsheet as catalog or collection of specimen - improving the interoperability process. The paper details evidences of construction patterns of spreadsheets as well as proposes a semantic representation to them.35768Control and Communication (INSTICC),Institute for Systems and Technologies of InformationAbraham, R., Erwig, M., Inferring templates from spreadsheets (2006) Proceeding of the 28th International Conference on Software Engineering-ICSE '06, 15, p. 182Connor, M.J.O., Halaschek-Wiener, C., Musen, M.A., Mapping master: A flexible approach for mapping spreadsheets to OWL (2010) Proceedings of the International Semantic Web Conference, pp. 194-208Doush, I.A., Pontelli, E., Detecting and recognizing tables in spreadsheets (2010) Proceedings of the 8th IAPR International Workshop on Document Analysis Systems-DAS '10, pp. 471-478Han, L., RDF123: From spreadsheets to RDF (2008) The Semantic Web, pp. 451-466. , SpringerHaslhofer, B., Klas, W., A survey of techniques for achieving metadata interoperability (2010) ACM Computing Surveys, 42 (2), pp. 1-37Seiie Ko Eun-Jung, J., Woo, W., Unified user-centric context: Who, where, when, what, how and why? (2005) Proceedings of the International Workshop on Personalized Context Modeling and Management for UbiComp Applications, pp. 26-34Jannach, D., Shchekotykhin, K., Friedrich, G., Automated ontology instantiation from tabular web sources - The all right system? (2009) Web Semantics: Science, Services and Agents on the World Wide Web, 7 (3), pp. 136-153Langegger, A., Wolfram, W., XLWrap-querying and integrating arbitrary spreadsheets with SPARQL (2009) The Semantic Web, pp. 359-374Mulwad, V., Using linked data to interpret tables (2010) Proceedings of the International Workshop on Consuming Linked Data, pp. 1-12Ouksel, A.M., Sheth, A., (1999) Semantic Interoperability in Global Information Systems A Brief Introduction to the Research Area and the Special Section, 28 (1), pp. 5-12Pérez, J., Arenas, M., Gutierrez, C., Semantics and complexity of SPARQL (2009) ACM Transactions on Database Systems, 34 (3), pp. 1-45Ponder, W.F., (2010) Evaluation of Museum Collection Data for Use in Biodiversity Assessment, 15 (3), pp. 648-657De Saussure, F., (2011) Course in general linguistics, , R. Harris, edSyed, Z., (2010) Exploiting A Web of Semantic Data for Interpreting Tables, pp. 26-27. , (AprilTolk, A., (2006) What Comes after the Semantic Web-PADS Implications for the Dynamic Web, pp. 55-62Venetis, P., Recovering semantics of tables on the web (2011) Proceedings of the VLDB Endowment, 4, pp. 528-538Yang, S., Bhowmick, S.S., Madria, S., Bio2X: A rule-based approach for semi-automatic transformation of semi-structured biological data to XML (2005) Data & Knowledge Engineering, 52 (2), pp. 249-271Zhao, C., Zhao, L., Wang, H., A spreadsheet system based on data semantic object (2010) 2010 2nd IEEE International Conference on Information Management and Engineering, pp. 407-411