Document Analysis group

Dipartimento di Ingegeria dell' Informazione
University of Florence
Via Santa Marta 3
50139 Firenze - Italy

Simone Marinai - Publications





Papers


[DLPR16]
S. Capobianco, S. Marinai.  Record Counting in Historical Handwritten Documents with Convolutional Neural Networks.  Int.l Workshop on Deep Learning for Pattern Recognition, 2016, http://arxiv.org/abs/1610.07393
[ICDAR15]
M. Zeshan Afzal, S. Capobianco, M. Imran Malik, S. Marinai, T. M. Breuel, A. Dengel, M. Liwicki. Deepdocclassifier: Document classification with deep Convolutional Neural Network.  13th International Conference on Document Analysis and Recognition, 2015, IEEE Press:pp. 1111-1115, 2015 doi:10.1109/ICDAR.2015.7333933
[WWW15]
C. Goncu, A. Madugalla, S. Marinai, K. Marriott, Accessible On-Line Floor Plans WWW 2015 pp. 388-398 doi:10.1145/2736277.2741660
[JCIMM14]
P. Frasconi, F. Gabbrielli, M. Lippi, S. Marinai, Markov Logic Networks for Optical Chemical Structure Recognition, Journal of Chemical Information and Modeling, Vol. 54, N. 28, pp. 2380-2390 (2014) doi:10.1021/ci5002197
[DIPR14]
S. Marinai, Page Similarity and Classification, In Handbook of Document Image Processing and Recognition, pp.223-253. Ed. D. Doermann, K. Tombre, Springer Verlag, 2014.
[DOCENG13]
ACM DL Author-ize serviceReflowing and annotating scientific papers on eBook readers
Simone Marinai
DocEng '13 Proceedings of the 2013 ACM symposium on Document engineering, 2013
[HIP13]
ACM DL Author-ize serviceContextual word spotting in historical manuscripts using Markov logic networks
David Fernández, Simone Marinai, Josep Lladós, Alicia Fornés
HIP '13 Proceedings of the 2nd International Workshop on Historical Document Imaging and Processing, 2013
[DocEng12]
S. Marinai, S. Quiriconi, Displaying chemical structural formulae in ePub format.
[ICPR12]
A. Barducci, S. Marinai, Object Recognition in Floor Plans by Graphs of White Connected Components.
[HIP11]
ACM DL Author-ize serviceTowards a faithful visualization of historical books on e-book readers
Simone Marinai, Alessio Anzivino, Matteo Spampani
HIP '11 Proceedings of the 2011 Workshop on Historical Document Imaging and Processing, 2011
[LSSD11]
S. Marinai, B. Miotti, G. Soda, Digital Libraries and Document Image Retrieval Techniques: A Survey, In Learning Structure and Schemas from Documents, Ed. Marenglen Biba, Fatos Xhafa, Springer Verlag, 2011.
[ICDAR11a]
S. Marinai, E. Marino, G. Soda.  Conversion of PDF books in ePub format.  11th International Conference on Document Analysis and Recognition, Beijing (China), 2011, IEEE Press:pp. , 2011
[ICDAR11b]
S. Marinai, B. Miotti, G. Soda.  Using Earth Mover's Distance in the Bag-of-Visual-Words Model for Mathematical Symbol Retrieval.  11th International Conference on Document Analysis and Recognition, Beijing (China), 2011, IEEE Press:pp. , 2011
[IJDAR11a]
S. Marinai, Text retrieval from early printed books, International Journal on Document Analysis and Recognition, Vol. 14, N. 2, pp. 117-130 Springer-Verlag, Berlin (D) doi:10.1007/s10032-010-0146-0
[IJDAR11b]
S. Marinai, D. Karatzas, Report from the AND 2009 working group on noisy text datasets , International Journal on Document Analysis and Recognition, Vol. 14, N. 2, pp. 113-116 Springer-Verlag, Berlin (D) doi:10.1007/s10032-011-0159-3
[CCIS10]
S. Marinai, B. Miotti, G. Soda Mathematical Symbol Indexing for digital libraries. Communications in Computer and Information Science, 2010, Volume 91, Part 4, 113-124, DOI: 10.1007/978-3-642-15850-6_12
[DOCENG10]
ACM DL Author-ize serviceTable of contents recognition for converting PDF documents in e-book formats
Simone Marinai, Emanuele Marino, Giovanni Soda
DocEng '10 Proceedings of the 10th ACM symposium on Document engineering, 2010
[AIIA10]
S. Marinai, B. Miotti, G. Soda Mathematical Symbol Indexing AI*IA 2009: 102-111 doi:10.1007/978-3-642-10291-2_11
[ICPR10]
S. Marinai, B. Miotti, G. Soda Bag of Characters and SOM Clustering for Script Recognition and Writer Identification ICPR 2010: 2182-2185 doi:10.1109/ICPR.2010.534
[IRCDL10]
S. Marinai, B. Miotti, G. Soda Mathematical Symbol Indexing for Digital Libraries IRCDL 2010: 113-124 doi:10.1007/978-3-642-15850-6_12
[ICIAP09]
S. Marinai, E. Marino, G. Soda Nonlinear Embedded Map Projection for Dimensionality Reduction, Proc. ICIAP 09, Springer Verlag, 2009. doi:10.1007/978-3-642-04146-4_25
[ICDAR09a]
S. Marinai, Metadata Extraction from PDF Papers for Digital Library Ingest, Proc. ICDAR 2009, IEEE, pp. 251-255 2009. doi:10.1109/ICDAR.2009.232
[ICDAR09b]
S. Marinai, B. Miotti, G. Soda, Mathematical Symbol Indexing Using Topologically Ordered Clusters of Shape Contexts, Proc. ICDAR 2009, IEEE, pp. 1041-1045, 2009. doi:10.1109/ICDAR.2009.120
[AND09]
ACM DL Author-ize serviceText retrieval from early printed books
Simone Marinai
AND '09 Proceedings of The Third Workshop on Analytics for Noisy Unstructured Text Data, 2009
An extended version of this paper has been published in [IJDAR11a]
[SPR08]
S. Marinai, E. Marino, G. Soda Embedded map projection for dimensionality reduction based similarity search, Proc. S+SSPR 2008, Springer Verlag, 2008. doi:10.1007/978-3-540-89689-0_62
[DAS08]
S. Marinai, E. Marino, G. Soda A comparison of clustering methods for word image indexing, Proc. DAS 2008, Springer Verlag, 2008.
[MLDAR08a]
S. Marinai, Introduction to Document Analysis and Recognition, In Machine Learning in Document Analysis and Recognition, Studies in Computational Intelligence 90, Ed. Simone Marinai, Hiromichi Fujisawa, Springer Verlag, 2008.
[MLDAR08b]
S. Marinai, E. Marino, G. Soda, Self-Organizing Maps for Clustering in Document Image Analysis, In Machine Learning in Document Analysis and Recognition, Studies in Computational Intelligence 90, Ed. Simone Marinai, Hiromichi Fujisawa, Springer Verlag, 2008.
[ICIAP07]
S. Marinai, E. Marino, G. Soda.  Transformation invariant SOM clustering in Document Image Analysis.  14th International Conference on Image Analysis and Processing, Modena (Italy), 2007, IEEE Press:pp. 185-190, 2007
[ECDL07]
S. Marinai, E. Marino, G. Soda.  Exploring Digital Libraries with Document Image Retrieval.  11th European Conference on Research and Advanced Technology for Digital Libraries, Budapest (Hungary), 2007, Springer Verlag, pp. 368-379.
[MYS07]
S. Marinai.  SOM clustering for text retrieval and classification with examples on Indian scripts.  Proc. of Brainstorming Workshop on OCR for Indian Languages 16-17 March, 2007, Mysore (India).
Invited talk
[PAMI06]
S. Marinai, M.Gori, G.Soda, Font Adaptive Word Indexing of Modern Printed Documents, IEEE Transaction PAMI, vol 28, N. 8, August 2006, pp. 1187-1199, IEEE Press, Los Alamitos (CA).
[CIFED06]
S. Marinai.  A survey of document image retrieval in digital libraries.  9th Colloque International Francophone sur l'Ecrit et le Document (CIFED 2006), pag. 193-198.
Invited talk
[DAS06]
S. Marinai, S. Faini, E. Marino, G. Soda.  Efficient word retrieval by means of SOM clustering and PCA.  7th International Workshop on Document Analysis Systems}, Nelson (New Zealand), 2006, LNCS: pp.
[DIAL06]
S. Marinai, E. Marino, G. Soda, Tree clustering for layout-based document image retrieval, Proceedings of the Second Int'l Workshop on Document Image Analysis for Libraries, pp. 243-251, Lyon (France), 2006, IEEE Press, Los Alamitos (CA).
[DAS06]
S. Marinai, S. Faini, E. Marino, G. Soda.  Efficient word retrieval by means of SOM clustering and PCA.  7th International Workshop on Document Analysis Systems}, Nelson (New Zealand), 2006, LNCS: pp.
[PAMI 05]
S. Marinai, M.Gori, G.Soda, Artificial Neural Networks for Document Analysis and Recognition, IEEE Transaction PAMI, vol 27, N. 1, January 2005, pp. 23-35, IEEE Press, Los Alamitos (CA).
[ICDAR05]
S. Marinai, E. Marino, G. Soda.  Layout based document image retrieval by means of XY tree reduction.  9th International Conference on Document Analysis and Recognition}, Seoul (Korea), 2005, IEEE Press:pp. 432-436, 2005
[NNLDAR05]
S. Faini, S. Marinai, E. Marino, G. Soda, SOM-based Document Image Retrieval, Proceeding of the 1st International IAPR Workshop on Neural Networks and Learning in Document Analysis and Recognition, pp. 33 -- 40, Seoul (Korea), 2005.
[AvivDlib05]
S. Marinai, E. Marino, G. Soda, Layout based document image retrieval in Digital Libraries, Proceeding of the 7th Int. Workshop Audio-Visual Content and Information Visualization in Digital Libraries (AVIVDiLib '05), Cortona (Italy), 2005 pp.67-76.
[DIAL04]
S. Marinai, E. Marino, F. Cesarini, G. Soda, A general system for the retrieval of document images from digital libraries, Proceedings of the First Int'l Workshop on Document Image Analysis for Libraries, pp. 150-173, Palo Alto (CA), 2004, IEEE Press, Los Alamitos (CA).
[PR03]
M. Gori, M. Maggini, S. Marinai, J. Q. Sheng, G. Soda, Edge-Backpropagation for Noisy Logo Recognition, Pattern Recognition, vol 36, N.1, 2003, pp. 103-110, Elsevier, Amsterdam (NL).
[ICDAR03a]
S. Marinai, E. Marino, G. Soda, Indexing and Retrieval of Words in Old Documents, Proceedings of ICDAR 2003, pp. 223-227, 2003, IEEE Press, Los Alamitos (CA).
This paper won the Best Paper Award at ICDAR 2003.
[ICDAR03b]
S. Baldi, S. Marinai, G. Soda, Using tree grammars for training set expansion in page classification, Proceedings of ICDAR 2003, pp. 829-833, 2003, IEEE Press, Los Alamitos (CA).
[IJDAR02]
E. Appiani, F. Cesarini, A.M. Colla, M. Diligenti, M.Gori, S.Marinai, G.Soda, Automatic document classification and indexing in high-volume applications, IJDAR, vol 4, N. 2 2001, pp. 69-83, Springer-Verlag, Berlin (D).
[ICPR02]
F. Cesarini, S. Marinai, L. Sarti, G. Soda, Trainable table location in document images, Proceedings of the 16th ICPR, pp. 236-240, Queb�c City (Canada), August 2002, IEEE Press, Los Alamitos (CA).
[DAS02]
F. Cesarini, S. Marinai, G. Soda, Retrieval by layout similarity of documents represented by MXY trees, Proceedings of the 5th IAPR International Workshop on Document Analysis Systems (DAS), pp. 353-364 Princeton (NJ, USA), August 2002, LNCS 2423, Springer-Verlag, Berlino (D).
[IJDAR01]
E. Francesconi, M.Gori, S.Marinai, G.Soda, A serial combination of connectionist-based classifiers for OCR, IJDAR, vol 3, N. 3 2001, pp. 160-168, Springer-Verlag, Berlin (D).
[ICDAR01]
F. Cesarini, M. Lastri, S. Marinai, G. Soda, Encoding of modified X-Y tress for document classification, Proceedings of ICDAR 2001, pp. 1131-1136, Seattle (USA), 2001, IEEE Press, Los Alamitos (CA).
[DEXA01]
F. Cesarini, M. Lastri, S. Marinai, G. Soda, Page classification for meta-data extraction from digital collections, Proceedings of DEXA 2001, Munich (D), 2001, pp. 82-91, LNCS 2113, Springer-Verlag, Berlin (D).
[ICDAR99a]
F.Cesarini, M. Gori, S. Marinai, G. Soda, Structured Document Segmentation and Representation by the Modified X-Y Tree, Proceedings of ICDAR 1999, pp. 563-566, Bangalore (India), 1999, IEEE Press, Los Alamitos (CA).
[ICDAR99b]
S. Marinai, P. Nesi, Projection based Segmentation of Musical Sheets, Proceedings of ICDAR 1999, pp. 563-566, Bangalore (India), 1999, IEEE Press, Los Alamitos (CA).
[PAMI98]
F.Cesarini, M.Gori, S.Marinai, G.Soda, INFORMys: a flexible INvoice-like FORM reader system, IEEE Transaction PAMI, vol 20, N. 7 July 1998, pp. 730-745, IEEE Press, Los Alamitos (CA).
[GREC97]
E.Francesconi, P.Frasconi, M. Gori, S. Marinai, J.Q. Sheng, G. Soda, A. Sperduti, Logo Recognition by Recursive Neural Networks, in Graphics Recognition, Algorithms and Systems, LNCS (1389) pp. 104 - 117, 1998, Springer Verlag, Berlino (D).
[DEXA97]
F.Cesarini, E.Francesconi, M.Gori, S.Marinai, J.Q.Sheng, G.Soda, Conceptual Modelling for Invoice Document Processing, Proceedings of the Conference DEXA '97 Workshop on Query Processing in Multimedia Information System, Toulose, September 1997, pp. 596-603, IEEE Press, Los Alamitos (CA).
[ICDAR97a]
F.Cesarini, E.Francesconi, M. Gori, S. Marinai, J.Q. Sheng, G. Soda, A Neural-based architecture for spot-noisy logo recognition, Proceedings of ICDAR 1997, pp. 175-179, Ulm (Germany), 1997, IEEE Press, Los Alamitos (CA).
[ICDAR97b]
F.Cesarini, E.Francesconi, M. Gori, S. Marinai, J.Q. Sheng, G. Soda, Rectangle labelling for an Invoice Understanding System, Proceedings of ICDAR 1997, pp. 324-330, Ulm (Germany), 1997, IEEE Press, Los Alamitos (CA).
[GREC97]
F.Cesarini, M.Gori, S.Marinai, G.Soda, A Hybrid System for Locating Low Level Graphic Items, in Graphics Recognition, Methods and Applications, LNCS (1072) pp. 135 - 147, 1996, Springer Verlag, Berlino (D).
[DEXA95]
F. Cesarini, M. Gori, S. Marinai, G. Soda, Data Extraction from Form Images, Proceedings of DEXA 1995, London (UK), 1995, pp. 438-448, LNCS 978, Springer-Verlag, Berlin (D).
[ICDAR95]
F.Cesarini, M. Gori, S. Marinai, G. Soda, A System for Data Extraction from Forms of Known Class, Proceedings of ICDAR 1995, pp. 1136-11409, Montreal, 1995, IEEE Press, Los Alamitos (CA).

Copyright notice

The documents listed in this site are provided as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

top

1st November 2007. Dante Group.