Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Semantify CEUR-WS Proceedings: Towards the Automatic Generation of Highly Descriptive Scholarly Publishing Linked Datasets

  • Conference paper
  • First Online:
Semantic Web Evaluation Challenge (SemWebEval 2014)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 475))

Included in the following conference series:

Abstract

Rich and fine-grained semantic information describing varied aspects of scientific productions is essential to support their diffusion as well as to properly assess the quality of their output. To foster this trend, in the context of the ESWC2014 Semantic Publishing Challenge, we present a system that automatically generates rich RDF datasets from CEUR-WS workshop proceedings. Proceedings are analyzed through a sequence of processing phases. SVM classifiers complemented by heuristics are used to annotate missing CEUR-WS markups. Annotations are then linked to external datasets like DBpedia and Bibsonomy. Finally, the data is modeled and published as an RDF graph. Our system is provided as an on-line Web service to support on-the-fly RDF generation. In this paper we describe the system and present its evaluation following the procedure set by the organizers of the challenge.

The work described in this paper has been funded by the European Project Dr Inventor (FP7-ICT-2013.8.1 - Grant no: 611383) and the project SKATER-UPF-TALN (TIN2012-38584-C06-03) funded by Ministerio de Economía y Competitividad, Secretaría de Estado de Investigación, Desarrollo e Innovación, Spain.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://dblp.l3s.de/d2r/

  2. 2.

    http://acm.rkbexplorer.com/

  3. 3.

    http://ieee.rkbexplorer.com/

  4. 4.

    A semantic markup approach that conveys metadata and other attributes in Web pages by existing HTML/XHTML tags.

  5. 5.

    A semantic markup useful to embed RDF triples within XHTML documents.

  6. 6.

    http://www.bibsonomy.org/

  7. 7.

    http://www.wikicfp.com/cfp/

  8. 8.

    https://gate.ac.uk/

  9. 9.

    The system described in this paper can be accessed on-line at: http://sempub.taln.upf.edu/eswc2014sempub/ (password: ceurrdf2014).

  10. 10.

    http://gate.ac.uk/sale/tao/splitch6.html

References

  1. Shotton, D.: Semantic publishing: the coming revolution in scientific journal publishing. Learn. Publ. 22(2), 85–94 (2009)

    Article  Google Scholar 

  2. Shotton, D., Portwin, K., Klyne, G., Miles, A.: Adventures in semantic publishing: exemplar semantic enhancements of a research article. PLoS Comput. Biol. 5(4), e1000361 (2009)

    Article  Google Scholar 

  3. Smit, E., Van Der Graaf, M.: Journal article mining: the scholarly publishers’ perspective. Learn. Publ. 25(1), 35–46 (2012)

    Article  Google Scholar 

  4. Bizer, C.: Linking data & publications expert report. Global Research Data Infrastructure of European Union (2012)

    Google Scholar 

  5. Ciancarini, P., Di Iorio, A., Nuzzolese, A.G., Peroni, S., Vitali, F.: Semantic annotation of scholarly documents and citations. In: Baldoni, M., Baroglio, C., Boella, G., Micalizio, R. (eds.) AI*IA 2013. LNCS, vol. 8249, pp. 336–347. Springer, Heidelberg (2013)

    Google Scholar 

  6. Attwood, T.K., Kell, D.B., McDermott, P., Marsh, J., Pettifer, S.R., Thorne, D.: Utopia documents: linking scholarly literature with research data. Bioinformatics 26(18), 568–574 (2010)

    Article  Google Scholar 

  7. Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: a framework and graphical development environment for robust NLP tools and applications. In: Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics, ACL (2002)

    Google Scholar 

  8. Li, Y., Bontcheva, K., Cunningham, H.: Adapting SVM for data sparseness and imbalance: a case study on information extraction. Nat. Lang. Eng. (Cambridge University Press) 15, 241–271 (2009)

    Google Scholar 

  9. Mendes, P.N., Jakob, M., Garca-Silva, A., Bizer, C.: DBpedia spotlight: shedding light on the web of documents. In: Proceedings of the 7th International Conference on Semantic Systems, pp. 1–8. ACM (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Francesco Ronzano .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Ronzano, F., del Bosque, G.C., Saggion, H. (2014). Semantify CEUR-WS Proceedings: Towards the Automatic Generation of Highly Descriptive Scholarly Publishing Linked Datasets. In: Presutti, V., et al. Semantic Web Evaluation Challenge. SemWebEval 2014. Communications in Computer and Information Science, vol 475. Springer, Cham. https://doi.org/10.1007/978-3-319-12024-9_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-12024-9_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-12023-2

  • Online ISBN: 978-3-319-12024-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics