Carlos Castillo: Social Computing and Web Mining

I am an ICREA Research Professor at Universitat Pompeu Fabra, where I lead the Web Science and Social Computing Research Group. My goal is to address issues of social significance through computer science and interdisciplinary research.

See also: Google Scholar · DBLP · ORCID · Scopus

Publications

Start here: [Algorithmic Fairness] [Crisis Informatics]

Working Papers (In Preparation or Under Review)

Alexandra Olteanu, Michael Ekstrand, Carlos Castillo, Jina Suh. Responsible AI Research Needs Impact Statements Too. Pre-print [arxiv|blogpost]

Francielle Marques, Davinia Hernandez-Leo, Carlos Castillo. Measuring gender bias in student satisfaction in higher education: A cross-department study. Under review [request by mail]

David Solans, Andrea Beretta, Manuel Portela, Carlos Castillo, Anna Monreale. Human Response to an AI-Based Decision Support System: A User Study on the Effects of Accuracy and Bias. Working draft. [arxiv]

To appear in 2024

Journal papers

Published in 2023 (6)

Journal papers

Conference papers

Workshop papers / short papers / abstracts

Laura Casanovas i Buliart, Priscila Alvarez-Cueva, Carlos Castillo. From The Beatles to Bad Bunny: Sexism in popular music through an automated text analysis. Poster at IC2S2. Copenhagen, Denmark, 2023. [poster|code]

Lorenzo Porcaro, Carlos Castillo, Emilia Gomez, João Vinagre: Fairness and Diversity in Information Access Systems. To be presented at the European Workshop on Algorithmic Fairness (EWAF). [arxiv|ewaf]

Ioannis Bilionis, Luis Fernandez-Luque and Carlos Castillo: A Survey on Public Data Sets Related to Chronic Diseases. Short paper to appear in CBMS.

Published in 2022 (11)

Journal papers

Conference papers

Workshop papers / short papers / abstracts

Marilena Budan, Carlos Castillo: The Coverage of Sexual Violence in Spanish News Media. ICWSM Workshop on Data for the Wellbeing of the Most Vulnerable. [aaai|doi]

Valerio Lorini, Carlos Castillo: "SMDRM: A Platform to Analyze Social Media for Disaster Risk Management in Near Real Time". To be presented in the SOMMER workshop, co-located with ICWSM. [aaai|doi]

Lorenzo Porcaro, Emilia Gómez, Carlos Castillo. Diversity in the Music Listening Experience: Insights from Focus Group Interviews. Proc. of ACM SIGIR Conference on Human Information Interaction and Retrieval (CHIIR), pp 272-276, ACM Press. [doi|arxiv]

Marzieh Karimi-Haghighi and Carlos Castillo: Quantitative analysis of disparate effects of RisCanvi for estimating the risk of violent recidivism. Technical Report, Web Science and Social Computing Research Group, Universitat Pompeu Fabra. October 2022.

Published in 2021 (6)

Journal papers

Conference papers

Workshop papers / short papers / abstracts

Marzieh Karimi-Haghighi, Carlos Castillo: Enhancing a recidivism prediction tool with machine learning: effectiveness and algorithmic fairness. ICAIL 2021 (Short papers), pp. 210-214. ACM Press. [doi].

Valerio Lorini, Carlos Castillo, Steve Peterson, Paola Rufolo, Hemant Purohit, Diego Pajarito, João Porto de Albuquerque, Cody Buntain: Social Media for Emergency Management: Opportunities and Challenges at the Intersection of Research and Practice. ISCRAM (WiP) 2021. [video]

Marzieh Karimi-Haghighi, Carlos Castillo, Davinia Hernandez-Leo, Veronica Moreno Oliver: Predicting Early Dropout: Calibration and Algorithmic Fairness Considerations. ADORE Workshop at LAK'21 [arxiv]

Valerio Lorini, Peter Salamon, Carlos Castillo: SMDRM - Social Media for Disaster Risk Management. EGU General Assembly 2021

Published in 2020 (8)

Journal papers

Conference papers

Workshop, short, and demo papers

Valerio Lorini, Carlos Castillo, Domenico Nappo, Francesco Dottori, and Peter Salamon: Social Media Alerts can Improve, but not Replace Hydrological Models for Forecasting Floods. Web Intelligence 2020 Short Papers, pp. 343-348. [arxiv]

Dougal Shakespeare, Lorenzo Porcaro, Emilia Gómez, Carlos Castillo: Exploring Artist Gender Bias in Music Recommendation. 2nd Workshop on the Impact of Recommender Systems., 2020.

Corinna Hertweck, Carlos Castillo, Michael Mathioudakis: Towards Data-Driven Affirmative Action Policies under Uncertainty. In Fairness, Accountability, and Transparency in Educational Data Cyberspace (FATED) Workshop. [arxiv]

Panayiotis Smeros, Carlos Castillo, Karl Aberer: SciLens News Platform: A System for Real-Time Evaluation of News Articles. In PVLDB 2020 Demos. [pvldb]

Meike Zehlike, Carlos Castillo: Reducing Disparate Exposure in Ranking: A Learning To Rank Approach. In WWW Short papers, Taipei, Taiwan. [arxiv]

Meike Zehlike, Tom Sühr, Carlos Castillo, Ivan Kitanovski: FairSearch: A Tool For Fairness in Ranked Search Results. To appear in WWW Demos, Taipei, Taiwan. [arxiv]

Proceedings

Mireille Hildebrandt, Carlos Castillo, Elisa Celis, Salvatore Ruggieri, Linnet Taylor, Gabriela Zanfir-Fortuna: Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency (FAT*2020). ACM Press, 2020. [www|doi]

Published in 2019 (6)

Journal papers

Conference papers

Short paper

Rahul Pandey, Carlos Castillo, and Hemant Purohit: Modeling Human Annotation Errors to Design Bias-Aware Systems for Social Stream Processing. In ASONAM (short papers). [doi|arxiv]

Workshop and work-in-progress papers

Fedor Vitiugin, Carlos Castillo: Comparison of Social Media in English and Russian During Emergencies. In ISCRAM. Valencia, Spain (WiPe). [slides]

Meike Zehlike, Carlos Castillo, Ivan Kitanovski. FairSearch: A Programming Library for Fair Search Results. At Data Science for Social Good Workshop. San Francisco, USA (Abstract).

Special issue

Yu-Ru Lin, Carlos Castillo, Jie Yin: Introduction to the Special Issue on AI for Disaster Management and Resilience. IEEE Intelligent Systems 34(3): 3-5 (2019).

Published in 2018 (4)

Conference papers

Research Report

Carlos Castillo: La oferta y disponibilidad de contenido audiovisual en la era de los datos masivos. Informe comisionado por el Consejo Audiovisual de Cataluña (CAC). Publicado en diciembre 2018. [versión en catalán|versión en castellano]

Workshop Paper, Short Paper, or Poster

Songül Tolan, Carlos Castillo, Marius Miron, Emilia Gómez: Expert assessment vs. machine learning algorithms: juvenile criminal recidivism in Catalonia. Presentation at the Algorithms and Society Workshop, Brussels, Belgium, December 2018. [slides]

Sofiane Abbar, Carlos Castillo and Antonio Sanfilippo: To Post or Not to Post: Using Online Trends to Predict Popularity of Offline Content. Short paper at Hypertext 2018. [acm|doi]

Oana Balalau, Carlos Castillo, Mauro Sozio: EviDense: a Graph-based Method for Finding Unique High-impact Events with Succinct Keyword-based Descriptions. Poster at ICWSM 2018. [data and code|version in arxiv]

Michael Mathioudakis, Carlos Castillo: Using STAN to Explore Fairness in University Admission Policies. Poster at STAN Conference 2018.

Keynote and Tutorial

Carlos Castillo: Fairness and Transparency in Ranking (Keynote at Data and Bias workshop at KDD). In SIGIR Forum, Vol. 52. No. 2, December 2018, pages 64-71. [slides|sigir forum|acm]

Alexandra Olteanu, Emre Kıcıman, Carlos Castillo, Fernando Diaz: A Critical Review of Online Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries. Tutorial at WSDM 2018, WWW 2018, SDM 2018. [doi]

Invited talks

Carlos Castillo: Big Crisis Data / Crisis Informatics. At: Russian Summer School on Information Retrieval (RuSSIR), Kazan, Russia, August 2018.

Carlos Castillo: "A Brief Overview of Sources and Manifestations of Bias When Working with Social Data." Summary of the tutorial based on the tutorial series on social data biases led by Alexandra Olteanu, for the Russian Summer School on Information Retrieval (RuSSIR). Kazan, Russia, August 2018.

Carlos Castillo: "Algorithmic Discrimination." Talk at BCN Analytics Data and Ethics event, Barcelona, April 2018. [youtube]

Workshop Proceedings

Yu-Ru Lin, Carlos Castillo, Jie Yin: The 5th International Workshop on Social Web for Disaster Management (SWDM'18): Collective Sensing, Trust, and Resilience in Global Crises. In Proc. WSDM 2018 [acm]

Published in 2017 (3)

Conference papers

Tutorial

Sara Hajian and Carlos Castillo: Discovering and Mitigating Algorithmic Discrimination. Tutorial at International Conference on Computational Social Science (IC2S2). Cologne, Germany. July 2017.

Alexandra Olteanu, Emre Kıcıman, Carlos Castillo, Fernando Diaz: A Critical Review of Online Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries. Tutorial at ICWSM 2016, KDD 2017.

Short paper and abstract

Michele Gentili, Sara Hajian and Carlos Castillo: A case study of anonymization of medical surveys. Short paper in Proceedings of Digital Health, pp. 77-81. London, UK, 2017. ACM [acm].

Dottori, F., Kalas M., Lorini V., Wania A., Pappenberger F., Salamon, P., Ramos M. H., Cloke, H. L., Castillo, C.: Satellites, tweets, forecasts: the future of flood disaster management?. European Geosciences Union General Assembly 2017. [EGU]

Technical report

Carlos Castillo, Francesco Fabbri, and Diego Saez-Trumper: Current Practices of Online Community Managers: A Report from Six Interviews. Technical Report, Eurecat, January 2017. [bibtex]

Invited talks

Carlos Castillo: From Discrimination Discovery to Fairness-Aware Data Mining. Invited talk at 3rd annual workshop of the Center for Semantic Web Research. Santiago, Chile. January 2017.

Carlos Castillo: Detecting Algorithmic Discrimination. Invited talk at EPFL. Lausanne, Switzerland. July 2017.

Published in 2016 (4)

Book

Journal articles

Conference paper

Short papers

Muhammad Imran, Sanjay Chawla, Carlos Castillo: A Robust Framework for Classifying Evolving Document Streams in an Expert-Machine-Crowd Setting. Short paper in Proc. of ICDM 2016. Dec 2016, Barcelona, Catalunya-Spain. [arxiv].

Muhammad Imran, Patrick Meier, Carlos Castillo, Andre Lesa and Manuel Garcia Herranz: Enabling Digital Health by Automatic Classification of Short Messages. Short paper in Proc. of ACM Digital Health 2016. [acm|new scientist]

Tutorials

Sara Hajian, Francesco Bonchi, Carlos Castillo: Algorithmic bias: from discrimination discovery to fairness-aware data mining. Tutorial at KDD 2016. [acm]. Slides: Parts I and II: discrimination discovery, Parts III and IV: fairness-aware data mining.
Video: Part I: Introduction and Context, Part II: Discrimination Discovery, Part III: Fairness-Aware Data Mining and Part IV: Challenges and Directions for Future Research.

Workshop and Special Issue

Carlos Castillo, Fernando Diaz, Yu-Ru Lin, and Jie Yin: The Fourth International Workshop on Social Web for Disaster Management (SWDM 2016). Co-located with CIKM in Indianapolis, US. [acm]

Symeon Papadopoulos, Kalina Bontcheva, Eva Jaho, Mihai Lupu, and Carlos Castillo: Overview of the Special Issue on Trust and Veracity of Information in Social Media. ACM Transactions on Information Systems (TOIS) 34 (3), 14. 2016 [doi]

Carlos Castillo: Detecting algorithmic discrimination. Keynote at Dutch-Belgian Information Retrieval Workshop (DIR). November 2016.

Published in 2015 (4)

Journal article

Conference articles

Workshop/Symposium

Irina Temnikova, Carlos Castillo, Sarah Vieweg: The Case for Readability of Crisis Communications in Social Media. SWDM 2015, 18-22 May in Florence, Italy. [acm|dataset]

Muhammad Imran, Carlos Castillo: Towards a Data-driven Approach to Identify Crisis-Related Topics in Social Media Streams. SWDM 2015, 18-22 May in Florence, Italy.

Irina Temnikova, Carlos Castillo, Sarah Vieweg: EMTerms 1.0: A Terminological Resource for Crisis Tweets. ISCRAM 2015, 24-27 May in Kristiansand, Norway. [data]

Talks

Carlos Castillo: "Big Crisis Data, an Open Invitation." Keynote at WebMedia 2015, Manaus, Brazil. [slides]

Carlos Castillo: "Social Media Mining and Retrieval". Tutorial at ESSIR 2015, Thessaloniki, Greece. [slides]

Daniela Iosub, David Laniado, Carlos Castillo, Mayo Fuster Morell and Andreas Kaltenbrunner: "Networked Emotions and Communication Styles in Online Collaboration". Plenary talk at IC2S2, 8-11 June in Helsinki, Finland. [video]

Carlos Castillo, Gianmarco De Francisci Morales, Marcelo Mendoza and Nasir Khan: "Automatic Analysis of Television News: Media, People, Framing and Bias". Parallel session talk accepted at IC2S2, 8-11 June in Helsinki, Finland.

Other

Aris Anagnostopoulos, Ioannis Chatzigiannakis, Carlos Castillo: Algorithmic Methods of Data Mining. Teaching Materials, Sapienza University of Rome, 2015.

Muhammad Imran, Ioanna Lykourentzou, Yannick Naudet and Carlos Castillo: Engineering Crowdsourced Stream Processing Systems. Technical report. [arxiv]

Aditi Gupta, Carlos Castillo, Ponnurangam Kumaraguru: "TweetCredCrisis: Real-time Assessment of Quality of Content Posted on Twitter during Crisis Events". Poster at the CERC-IIITD Security and Privacy Symposium 2015.

Published in 2014 (8)

Journal articles

Conference articles

Conference proceedings volume

Ben Carterette, Fernando Diaz, Carlos Castillo, Donald Metzler (Eds.): Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, WSDM 2014. New York, USA. ACM, Feb. 24-28, 2014.

Demos

Muhammad Imran, Carlos Castillo, Ji Lucas, Patrick Meier and Sarah Vieweg: AIDR: Artificial Intelligence for Disaster Response. In WWW 2014 demo [aidr.qcri.org|acm]. See also: talk at CrisisMappers conference (video).

Kiran Garimella and Carlos Castillo: FAST: Forecast and Analytics of Social Media and Traffic. In CSCW 2014 (demos). [fast.qcri.org|acm]

Symposium and workshop articles

Yelena Mejova, Amy X. Zhang. Nicholas Diakopoulos, Carlos Castillo: Controversy and Sentiments in Online News. Poster in Symposium on Computational Journalism. [arxiv]

Muhammad Imran, Carlos Castillo, Ji Lucas, Patrick Meier, Jakob Rogstadius: Coordinating Human and Machine Intelligence to Classify Microblog Communications in Crises. In ISCRAM 2014.

Muhammad Imran and Carlos Castillo: Volunteer-powered Automatic Classification of Social Media Messages for Public Health in AIDR. Public Health in the Digital Age workshop in WWW 2014.

Tutorial and talk

Carlos Castillo, Fernando Diaz, and Hemant Purohit: Leveraging Social Media and Web of Data to Assist Crisis Response Coordination. Tutorial at SDM, Philadelphia, PA, USA. April 2014.

Carlos Castillo: Crisis Computing: Finding Relevant and Credible Information in Social Media During Disasters. Keynote at Big Data Analytics. Delhi, India, December 2014. [slides]

Other

Carlos Castillo: Predicting the Future with Big Data. In Al Jazeera English / Opinion, series on Big Data. 1 March 2014.

Carlos Castillo: How Tweets and Algorithms Can Save Lives. In Al Jazeera English / Opinion, 5 December 2014.

Sarah Vieweg and Carlos Castillo: Combining Human and Machine Intelligence for Processing of Twitter Data During Mass Emergencies. STCSN e-letter vol. 2 no. 1.

Sandra Gonzalez-Bailon, Gianmarco De Francisci Morales, Marcelo Mendoza, Nasir Khan and Carlos Castillo: "Cable News Coverage and Online News Stories: A Large-Scale Comparison of Media Bias". Technical Report, 2014. [ssrn preprint]

Published in 2013 (7)

Monograph

Journal article

Conference articles

Conference article (short paper)

Diego Sáez-Trumper, Carlos Castillo and Mounia Lalmas: Social Media News Communities: Gatekeeping, Coverage, and Statement Bias (+ supplementary material). In CIKM 2013 (short paper) [acm|slides|mirror|bib|DATASET]

Tutorial

Hemant Purohit, Carlos Castillo, Patrick Meier and Amit Sheth: Crisis Mapping, Citizen Sensing and Social Media Analytics. Tutorial at ICWSM, May 2013.

Invited talks

Carlos Castillo: Social Media News Mining and Automatic Content Analysis of News. Invited talk at Tow Center, Columbia University. New York City, USA, 2013. [VIDEO|blogpost|invitation]

Carlos Castillo: News and Social Media. Keynote at the Social News on the Web (SNOW) workshop. Rio de Janeiro, Brazil, 2013. [acm|slides]

Workshop/symposium articles

Carlos Castillo, Gianmarco De Francisci Morales, Marcelo Mendoza, Nasir Khan: Says Who? Automatic Text-based Content Analysis of Television News. Workshop on Mining Unstructured Data Using NLP (UnstructureNLP) , co-located with CIKM. San Francisco, CA, 2013. [arxiv|acm]

Janette Lehmann, Carlos Castillo, Mounia Lalmas and Ethan Zuckerman: Finding News Curators in Twitter. Social News on the Web (SNOW) workshop. Rio de Janeiro, Brazil, 2013. [mirror|blogpost|slides|bib|acm]

Abdulfatai Popoola, Dmytro Krasnoshtan, Attila Toth, Victor Naroditskiy, Carlos Castillo, Patrick Meier and Iyad Rahwan: Information Verification during Natural Disasters. Social Web and Disaster Management (SWDM) workshop. Rio de Janeiro, Brazil, 2013. [slides|acm|veri.ly|new scientist (free reg) (local copy)|mit technology review|heise online|foreign policy|the national (uae)]

Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz and Patrick Meier: Practical Extraction of Disaster-Relevant Information from Social Media. Social Web and Disaster Management (SWDM) workshop. Rio de Janeiro, Brazil, 2013. [acm]

Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz and Patrick Meier: Extracting Information Nuggets from Disaster-Related Messages in Social Media. In ISCRAM. Baden-Baden, Germany, 2013. Best paper award (see "Practical Extraction ..." for a follow-up to this work). [slides].

Soudip Roy Chowdhury, Muhammad Imran, Rizwan Asghar, Sihem Amer-Yahia, and Carlos Castillo: "Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages". In ISCRAM. Baden-Baden, Germany, 2013.

Fuming Shih, Oshani Seneviratne, Daniela Miao, Ilaria Liccardi, Lalana Kagal, Evan Patton, Patrick Meier, Carlos Castillo: Democratizing Mobile App Development for Disaster Management. To be presented at the IJCAI Workshop on Semantic Cities. Beijing, China, 2013. [mit news|wired uk|homeland security news wire]

Posters/Demos

Carlos Castillo, Gianmarco De Francisci Morales and Ajay Shekhawat: "Online Matching of Web Content to Closed Captions in IntoNow". SIGIR Demos, 2013. [acm]

Sihem Amer-Yahia, Francesco Bonchi, Carlos Castillo, Esteban Feuerstein, Isabel Mendez-Diaz and Paula Zabala: "Complexity and Algorithms for Composite Retrieval". WWW posters, 2013 [see also extended version|acm].

Published in 2012 (2)

Conference articles

Symposium articles

David Laniado, Andreas Kaltenbrunner, Carlos Castillo, Mayo Fuster-Morell: Emotions and dialogue in a peer-production community: the case of Wikipedia. In WikiSym 2012. [slides|acm]

Robert West, Ingmar Weber, Carlos Castillo: Drawing a Data-Driven Portrait of Wikipedia Editors. In WikiSym 2012. [acm|slides]

Tutorials and invited talks

Carlos Castillo, Wei Chen, Laks V. S. Lakshmanan: Information and Influence Spread in Social Networks, KDD 2012 Tutorial. [slides: introduction, data and software, influence maximization, other issues]

Carlos Castillo: Mining Search Behavior and User-Generated Content. In EDBT 2012, Industrial track. [acm]

Poster

Robert West, Ingmar Weber, Carlos Castillo: A Data-Driven Sketch of Wikipedia Editors. WWW Posters, 2012 [photo|acm].

Published in 2011 (6)

Monograph

Journal articles

Conference articles

Workshop report

Carlos Castillo, Zoltán Gyöngyi, Adam Jatowt, Katsumi Tanaka: Joint WICOW/AIRWeb workshop on web quality (WebQuality 2011). WWW (Companion Volume), pp. 313-314, 2011. [acm]

Published in 2010 (8)

Book chapter

Journal articles

Conference articles

Workshop articles and talks

Dino Ienco, Francesco Bonchi, Carlos Castillo: "The Meme Ranking Problem: Maximizing Microblogging Virality". In SIASP workshop. Sydney, Australia. [ieee|bib]

Marcelo Mendoza, Barbara Poblete, Carlos Castillo: "Twitter Under Crisis: Can we trust what we RT?". In SOMA 2010: KDD Workshop on Social Media Analytics, Washington, DC. July 2010. [acm|bib|soma|VIDEO|wall street journal|scientific american]

Ranieri Baraglia, Carlos Castillo, Debora Donato, Franco Maria Nardini, Raffaele Perego and Fabrizio Silvestri: "The Effects of Time on Query Flow Graph-based Models for Query Suggestion". In proceedings of RIAO. Paris, France, 2010. [slides]

Carlos Castillo, Aristides Gionis, Ronny Lempel, Yoelle Maarek: "When no clicks are good news". Industry track, SIGIR 2010. Geneva, Switzerland. [slides|video (teaser)]

Encyclopedia Entry

Carlos Castillo and Ricardo Baeza-Yates: "Web Retrieval and Mining". In Encyclopedia of Library and Information Sciences, Third Edition. Taylor & Francis, pp.5615-5622, 2010. [bib|request by mail]

Course Materials (in Spanish)

Mari Carmen Marcos. Entrevista a Carlos Castillo [on line]. "Hipertext.net", núm. 8, 2010.

Published in 2009 (5)

Journal articles

Conference articles

Conference article (short paper)

Ranieri Baraglia, Carlos Castillo, Debora Donato, Franco Maria Nardini, Raffaele Perego, Fabrizio Silvestri: "Aging effects on Query Flow Graph for Query Suggestion" (short paper). In CIKM 2009, pp. 1947-1950. ACM Press. [bib|poster|acm]

Workshop articles

Paolo Boldi, Francesco Bonchi, Carlos Castillo, Debora Donato, Sebastiano Vigna: "Query Suggestions Using Query-Flow Graphs". Workshop on Web Search Click Data (WSCD), pp. 56-63, 2009. [acm|slides|bib]

Marcin Sydow, Francesco Bonchi, Carlos Castillo, Debora Donato: "Optimising Topical Query Decomposition". Workshop on Web Search Click Data (WSCD), pp. 43-47, 2009. [acm|slides|bib]

Talks

Video: Minería de logs de consulta (in Spanish). Universidad de Oviedo, 2009-05-27

Video: 'Análisis de enlaces y detección de spam en la Web (in Spanish). Universidad de Oviedo, 2009-05-28. Press Coverage @ La Nueva España

Query-log Mining. Universidade Federal de Minas Gerais, 2009-03-19

Published in 2008 (8)

Journal Articles

Conference Articles

Workshop articles

Carlos Castillo, Claudio Corsi, Debora Donato, Paolo Ferragina, Aristides Gionis: "Query log mining for detecting polysemy and spam". In Proc. of WebKDD, Las Vegas, USA, 2008. Springer. [bib]

Carlos Castillo, Claudio Corsi, Debora Donato, Paolo Ferragina and Aristides Gionis: "Query-log mining for detecting spam". Proceedings of AIRWeb 2008, pp. 17-20. Beijing, China. ACM Press. [bib|acm]

Jacob Abernethy, Olivier Chapelle and Carlos Castillo: "Webspam Identification Through Content and Hyperlinks". Proceedings of AIRWeb 2008, pp. 41-44. Beijing, China. [bib|acm]

Workshop/Project Report

Carlos Castillo, Kumar Chellapilla, Brian Davison, "AIRWeb'07 Workshop report". SIGIR Forum, June 2008, pp. 68-72. [bib|acm|sigirf]

Carlos Castillo, Kumar Chellapilla, Dennis Fetterly, "Fourth international workshop on Adversarial Information Retrieval on the Web (AIRWeb 2008)". In WWW Workshops, April 2008. [bib|acm]

Luca Becchetti, Carlos Castillo, Debora Donato, Stefano Leonardi and Ricardo Baeza-Yates: "Web spam detection: Link-based and content-based techniques". In Friedhelm Meyer (Ed.), The European Integrated Project Dynamically Evolving, Large Scale Information Systems (DELIS): proceedings of the final workshop, pp. 99-113. Heinz-Nixdorf Institut, Universität Paderborn. [bib]

Poster

Antti Ukkonen, Carlos Castillo, Debora Donato, Aristides Gionis: "Searching the Wikipedia with contextual information". Proceedings of CIKM, pp. 1351-1352. Napa Valley, CA, USA, October 2008. ACM Press. [bib|acm]

Book Chapter

Marcin Sydow, Jakub Piskorski, Dawid Weiss, Carlos Castillo: "Fighting Web Spam". In F. Fogelman-Soulié et al. (eds.): Mining Massive Data Sets for Security, Vol. 19 of NATO SPSS Series D., pp. 134-153. IOS Press, 2008. [VIDEO|bib|request by mail]

Invited Column

Carlos Castillo, Yiyu Yao: "EvalWare: Granular Computing for Web Applications". IEEE Signal Processing Magazine, Vol. 25, No. 2, pp. 142-143, March 2008. [ieee|bib]

Published in 2007 (6)

Journal Articles

Conference Articles

Workshop Articles

Josiane-Xavier Parreira, Debora Donato, Carlos Castillo, Gerhard Weikum: "Computing Trusted Authority Scores in Peer-to-Peer Networks". Workshop on Adversarial Information Retrieval on the Web (AIRWeb), pp. 73-80. Banff, Canada. 2007. [bib|y!|airweb|acm]

Debora Donato, Mario Paniccia, Maddalena Selis, Carlos Castillo, Giovanni Cortese, Stefano Leonardi: "New Metrics for Reputation Management in P2P Networks". Workshop on Adversarial Information Retrieval on the Web (AIRWeb), pp. 65-72. Banff, Canada. 2007. [bib|y!|airweb|acm]

Invited Paper

Ricardo Baeza-Yates, Carlos Castillo, Flavio Junqueira, Vassilis Plachouras, Fabrizio Silvestri: "Challenges on Distributed Information Retrieval" (Invited Paper). International Conference on Data Engeneering (ICDE). Istanbul, Turkey, April 2007. IEEE CS Press. [bib|talk|y!|ieee]

Workshop Proceedings

Carlos Castillo, Kumar Chellapilla, Brian D. Davison (chairs/editors): "Proceedings of the 3rd international workshop on Adversarial information retrieval on the web". ACM ICPS, Vol. 215. 2007. [bib|acm]

National Journal

Carlos Castillo, Bartlomiej Starosta, Marcin Sydow "Crawl.pl: Measuring Statistical and Structural Properties of the Polish Web", Studia Informatica, 1(8), pp. 43-73, PL ISSN : 1731-2264, Academy of Podlasie Press, 2007. [bib]

Regional Conference

Gabriel H. Tolosa, Fernando R. A. Bordignon, Ricardo Baeza-Yates, Carlos Castillo: "Caracterización del Espacio Web de Argentina" (in spanish). To be presented in CLEI. Costa Rica, 2007.

Published in 2006 (7)

Journal Articles

Conference Articles

Encyclopedic Article

  • Ricardo Baeza-Yates and Carlos Castillo: "Web Searching". In Keith Brown, (Editor-in-Chief), Encyclopedia of Language and Linguistics, Second Edition, Vol. 13, pp. 527-537. Oxford: Elsevier, 2006.

Workshop Articles

Luca Becchetti, Carlos Castillo, Debora Donato and Adriano Fazzone: "A Comparison of Sampling Techniques for Web Characterization". In Proceedings of the Workshop on Link Analysis (LinkKDD). Philadelphia, USA, August 2006. ACM Press. [bib|linkkdd]

Luca Becchetti, Carlos Castillo, Debora Donato, Stefano Leonardi, Ricardo Baeza-Yates: "Using Rank Propagation and Probabilistic Counting for Link-Based Spam Detection". In Proceedings of the Workshop on Web Mining and Web Usage Analysis (WebKDD). Philadelphia, USA, August 2006. ACM Press. [bib|webkdd|acm|VIDEO] (See also DELIS TR-0341).

Luca Becchetti, Carlos Castillo, Debora Donato, Stefano Leonardi, Ricardo Baeza-Yates: "Link-Based Characterization and Detection of Web Spam". Workshop on Adversarial Information Retrieval on the Web (AIRWeb). Seattle, USA, August 2006. [bib|airweb|talk@bcn]

Gemma Boleda, Stefan Bott, Carlos Castillo, Rodrigo Meza, Toni Badia, Vicente López: "CUCWeb: a Catalan corpus built from the Web". 2nd Workshop on the Web as a Corpus at EACL'06. Trento, Italy, April 2006. [bib|eacl]

Newsletter

Carlos Castillo, Debora Donato, Luca Becchetti, Paolo Boldi, Massimo Santini, Sebastiano Vigna: "A Reference Collection for Web Spam". SIGIR Forum, Vol. 40, No. 2, December 2006. [dataset|www|sigirf|bib|y!|acm]. DELIS technical report DELIS-TR-0405.

Posters

Luca Becchetti and Carlos Castillo: "The Distribution of PageRank Follows a Power-Law only for Particular Values of the Damping Factor". World Wide Web Conference (posters), pp. 941-942. Edinburgh, Scotland, May 2006. [www2006|acm]

Ricardo Baeza-Yates and Carlos Castillo: "Relationship between Links and Trade". World Wide Web Conference (posters), pp. 927-928. Edinburgh, Scotland, May 2006. [delis-tr-0253|www2006|acm]

Patrizia Andronico, Marina Buzzi, Carlos Castillo and Barbara Leporini: "Testing Google Interfaces Modified for the Blind". World Wide Web Conference (posters), pp. 873-874. Edinburgh, Scotland, May 2006. [www2006|acm]

Published in 2005 (2)

Journal Article

Conference Article

Workshop Articles

Ricardo Baeza-Yates, Carlos Castillo and Vicente López: "Pagerank Increase under Different Collusion Topologies". Workshop on Adversarial Information Retrieval on the Web (AIRWeb). Chiba, Japan, 2005. [airweb|talk|bib]

Ricardo Baeza-Yates and Carlos Castillo: "Link Analysis in National Web Domains". Workshop on Open Source Web Information Retrieval (OSWIR), pp. 15-18. Compiegne, France, September 2005. [bibtex|oswir|talk] (extended in "Characterization of National Web Domains" 2006)

Carlos Castillo and Ricardo Baeza-Yates: "WIRE: an Open-Source Web Information Retrieval Environment". Workshop on Open Source Web Information Retrieval (OSWIR), pp. 27-30. Compiegne, France, September 2005 . [bib|oswir|website|talk]

Albert Bifet, Carlos Castillo, Paul-Alexandru Chirita and Ingmar Weber: "An Analysis of Factors Used in a Search Engine's Ranking". Workshop on Adversarial Information Retrieval on the Web (AIRWeb), synopsis. Chiba, Japan, 2005. [bib]. Reprinted in 2007 as a chapter of the book "Internet Search Engines -- An Introduction" edited by Ravi Kumar Jain B.; Chapter 5, pp. 76-95, ICFAI University Press.

National Conference

Marco Modesto, Álvaro R. Pereira Jr., Nivio Ziviani, Carlos Castillo and Ricardo Baeza-Yates: "Un Novo Retrato da Web Brasileira" (in portuguese) , SEMISH Symposium, pp. 2005-2017. São Leopoldo, Brazil. July 2005. [bib]

Abstract

Carlos Castillo: "Effective Web Crawling (Doctoral Abstract)". ACM SIGIR Forum Vol.39 No. 1, pp. 55-56. June 2005. [acm]

Technical Reports

Carlos Castillo and Ricardo Baeza-Yates: "Practical Web Crawling". Technical Report, 2005.

Carlos Castillo and Ricardo Baeza-Yates: "Visualizing the European Trade Graph". Technical re port DELIS-TR-0252, DELIS (Dynamically Evolving Large-scale Information Systems), 2005. [delis]

Ricardo Baeza-Yates, Paolo Boldi and Carlos Castillo: "The Choice of a Damping Factor for Propagating Importance in Link-Based Ranking". Technical report RI-DSI N. 305-05 , Dipartimento di Scienze dell'Informazione, Università degli Studi di Milano, September 2005. [bib|unimi|talk@pisa] (reviewed and published in 2006 in SIGIR)

Ricardo Baeza-Yates and Carlos Castillo: "Caracterización de la Web Chilena" (in spanish). Technical report, Center for Web Research, Universidad de Chile, 2005. [website]

Patrizia Andronico, Marina Buzzi, Carlos Castillo and Barbara Leporini: "Search Engine UIs: remote usability test with blind persons". Technical report TR-15/2005, Istituto di Informatica e Telematica (IIT), Consiglio Nazionale delle Ricerche (CNR). Pisa, Italy, 2005. [request by e-mail]

Published in 2004 (5)

Book Chapter

Journal Article

Conferences and Workshops with Proceedings

National Conferences

G. Boleda, S. Bott, B. Poblete, C. Castillo, M.E. Fuenmayor, T. Badia, V. López: "CuCWeb, un corpus del català construït a partir de la web" (in catalan). Congrés Societat del Coneixement. Barcelona, España, 2004. [html]

Poster

Efthimis N. Efthimiadis, Carlos Castillo: "Charting the Greek Web". ASIST Conference (Poster), Providence, Rhode Island, USA, 2004. [bibtex]

Thesis

Carlos Castillo: "Efficient Web Crawling". PhD Thesis. Universidad de Chile, 2004. [bib]

Technical Reports

Ricardo Baeza-Yates, Felipe Lalanne, Carlos Castillo, Georges Dupret: "Comparing the characteristics of the Korean and the Chilean Web". Technical report, ITCC, DCC, University of Chile, 2004.

Ricardo Baeza-Yates, Carlos Castillo and Efthimis Efthimiadis: "Comparing the characteristics of the Chilean and the Greek Web". Technical report, Universidad de Chile, 2004.

Published in 2003 (1)

Conferences

  • A. Jaimes, J. Ruiz-del-Solar, R. Verschae, D. Yaksic, R. Baeza-Yates, E. Davis and C. Castillo: "On the Image Content of the Chilean Web". Latin American Web Conference (LA-WEB), IEEE Cs. Press, pp.72-83. Santiago, Chile, 2003. [bib|ieee]

Poster

Carlos Castillo: "Cooperation schemes between a Web server and a Web search engine". Latin American Web Conference LA-WEB (Extended Poster), IEEE Cs. Press, pp. 31a-35a. Santiago, Chile, 2003. [bib|ieee]

Technical Reports

Vicente López, Carlos Castillo and Joan Codina: "Information Retrieval in Mail Archives". Technical report, Cátedra Telefónica de Producción Multimedia, Universitat Pompeu Fabra, 2003.

Carlos Castillo: "Estudio de idiomas en las páginas Web españolas (dominio .ES)" (in spanish).Technical report, Cátedra Telefónica de Producción Multimedia, Universitat Pmpeu Fabra, 2003.

Published in 2002 (3)

Journal Article

Conferences

Poster

Carlos Castillo and Ricardo Baeza-Yates: "A New Model for Web Crawling". World Wide Web Conference (Poster). Honololulu, USA, 2002. [bib]

Published in 2001 (1)

Conference

National Conference

Carlos Castillo: "Newtenberg: Un Modelo e Implementación de un sistema de Publicaciones Digitales en la Web" (in spanish). Encuentro Chileno de Ciencias de la Computación. Punta Arenas, Chile. 2001.

Poster

Ricardo Baeza-Yates and Carlos Castillo: "Relating Web Structure and User Behavior". World Wide Web Conference (Poster). Hong Kong, 2001.

Technical Report

Ricardo Baeza-Yates and Carlos Castillo: "Analysis of Link-Based Ranking for the Web". Technical report, University of Chile, 2001.

Published in 2000

National Conference

Ricardo Baeza-Yates and Carlos Castillo: "Caracterizando la Web Chilena". (in spanish) Encuentro Chileno de Ciencias de la Computación, año 2000. [poster|bib]

Thesis

Carlos Castillo: "Características de la Web Chilena y Extensiones a un Buscador Web" (in spanish), Memoria de título, Universidad de Chile, año 2000.


See also: Google Scholar - DBLP - Microsoft Academic Search - ACM - CiteULike - DBLife - CSB - Arnet Miner - arXiv - ORCID - PubZone - Publons - WorldCat - US Library of Congress

Notes: (i) articles published by ACM/IEEE/Springer are the author's version, and can be downloaded from this page for personal use, but not posted in other web sites or mailing lists (ii) the numbers in parenthesis are the number of peer-reviewed works published in international journals or top-tier conferences with proceedings (iii) key papers are in boldface.