Tirthankar Ghosal

Tirthankar Ghosal is a researcher with the Institute of Formal and Applied Linguistics, Charles University, Czech Republic. Tirthankar did his Ph.D. from the Indian Institute of Technology Patna. His main research interests are NLP/ML for Scientific Discourse Processing and Peer Reviews, Text/Dialogue Summarization, Argumentation Mining. He is looking after the Automatic Minuting module of ELITR. He is also the principal organizer of Automatic Minuting community events: SummDial at SIGDial 2021 and AutoMin shared task at Interspeech 2021 for ELITR.


Address👉Pokaijote, P.O. Champasari, Siliguri, West Bengal, India – 734003
Email👉<first-name>.slg@gmail.com, <last-name>@ufal.mff.cuni.cz
Linkedin👉www.linkedin.com/in/tirthankar-ghosal-ai/
Twitter👉@TirthankarSlg

RESEARCH INTERESTS

  • Scholarly Language Processing
    • Natural Language Processing
    • Machine Learning and Deep Learning
    • Publication Mining
    • Argumentation Mining
  • Information Extraction and Retrieval
  • Text and Speech Processing, Meeting/Dialogue Summarization
  • Meta Science (Meta Research, Science of Science)
  • Scientometrics
    • Bibliometric Intelligence
  • Knowledge Discovery
    • Graph Mining, Knowledge Graphs
  • Artificial Intelligence

PROFESSIONAL EMPLOYMENTS

Researcher at Charles University, Czech RepublicJanuary 2021 onwards till date 
Visvesvaraya Research Fellow
Indian Institute of Technology Patna, India
January 2016 – December 2020
Research Intern
Oak Ridge National Laboratory, US
March 2019 – September 2019
Assistant Professor
Sikkim Manipal Institute of Technology, India
August 2012 – December 2015

EDUCATION

Ph.D. (Computer Science and Engineering)
Indian Institute of Technology Patna
Advisors: Dr. Asif Ekbal , Prof. Pushpak Bhattacharyya
January 2016 – August 2021
Master of TechnologyComputer Science and Engineering
Sikkim Manipal Institute of Technology
August 2012 – June 2015
Master of Computer Applications
University of North Bengal
July 2009 – June 2012
Bachelor of Computer Applications
University of North Bengal
June 2006 – June 2009

ACTIVE RESEARCH PROBLEMS

  1. Textual Novelty Detection (Scientific+News Domain)
  2. Significance of Peer Reviews and related problems in Scholarly Communications
  3. Measuring Research Pervasiveness via Citation Mining and Scholarly Full Text
  4. (Scientific) Misinformation Detection
  5. Entity-linking and Knowledge Discovery from Scientific Publications
  6. Text/Speech/Dialogue/Meeting Summarization and Automatic Minuting
  7. Knowledge Graphs on Scholarly Publications, Electronic Health Records, Radiology Reports, Medical Imaging
  8. Diversity and Inclusion in NLP
  9. Argumentation Mining in Scholarly and Legal Discourse

PUBLISHED PAPERS

[P1] Ghosal, T., Das, S.K., & Bhattacharjee, S. (2015). Sentiment analysis on (Bengali horoscope) corpus. 2015 Annual IEEE India Conference (INDICON), 1-6.

[P2] Saikh, T., Ghosal, T., Ekbal, A., & Bhattacharyya, P. (2017, December). Document-level novelty detection: textual entailment lends a helping hand. In Proceedings of the 14th International Conference on Natural Language Processing (ICON-2017) (pp. 131-140). [Only ACL recognized conference in India]

[P3] Ghosal, T., Salam, A., Tiwari, S., Ekbal, A., & Bhattacharyya, P. (2018). TAP-DLND 1.0: A Corpus for Document Level Novelty Detection. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7-12, 2018. [H-index 45]

[P4] Ghosal, T., Sonam, R., Saha, S., Ekbal, A., & Bhattacharyya, P. (2018). Investigating domain features for scope detection and classification of scientific articles. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (pp. 7-12).

[P5] Ghosal, T., Verma, R.K., Ekbal, A., Saha, S., Bhattacharyya, P., Chivukula, S.S., Tsatsaronis, G., Coupet, P., & Gregory, M.L. (2018). Can your paper evade the editor’s axe? Towards an AI-assisted peer review system.

[P6] Ghosal, T. (2018) Exploring the Implications of Artificial Intelligence in Various Aspects of Scholarly Peer Review. Bulletin of the IEEE Technical Committee on Digital Libraries, In the Doctoral Consortium of the 18th ACM/IEEE Joint Conference on Digital Libraries [Core Rank A*]

[P7] Ghosal, T., Verma, R., Ekbal, A., Saha, S., & Bhattacharyya, P. (2018, May). Investigating Impact Features in Editorial Pre-Screening of Research Papers. In Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries (pp. 333-334). ACM. [Core Rank A*]

[P8] Ghosal, T., Edithal, V., Ekbal, A., Bhattacharyya, P., Tsatsaronis, G., & Chivukula, S. S. S. K. (2018, August). Novelty Goes Deep. A Deep Neural Solution To Document Level Novelty Detection. In Proceedings of the 27th International Conference on Computational Linguistics (pp. 2802-2813). [Core Rank A, H-index 41]

[P9] Ghosal, T., Shukla, A., Ekbal, A., & Bhattacharyya, P. (2019, July). To Comprehend the New: On Measuring the Freshness of a Document. In the 2019 International Joint Conference on Neural Networks (IJCNN) (pp. 1-8). IEEE. [Core Rank A, H-Index 36]

[P10] Ghosal, T., Sonam, R., Ekbal, A., Saha, S., & Bhattacharyya, P. (2019, June). Is the Paper Within Scope? Are You Fishing in the Right Pond?. In 2019 ACM/IEEE Joint Conference on Digital Libraries (JCDL) (pp. 237-240). IEEE. [Core Rank A*]

[P11] Ghosal, T., Raj, A., Ekbal, A., Saha, S., & Bhattacharyya, P. (2019, June). A Deep Multimodal Investigation To Determine the Appropriateness of Scholarly Submissions. In 2019 ACM/IEEE Joint Conference on Digital Libraries (JCDL) (pp. 227-236). IEEE. [Core Rank A*]

[P12] Ghosal, T., Dey, D., Dutta, A., Ekbal, A., Saha, S., & Bhattacharyya, P. (2019, June). A Multiview Clustering Approach To Identify Out-of-Scope Submissions in Peer Review. In 2019 ACM/IEEE Joint Conference on Digital Libraries (JCDL) (pp. 392-393). IEEE. [Core Rank A*]

[P13] Ghosal, T., Chakraborty, A., Sonam, R., Ekbal, A., Saha, S., & Bhattacharyya, P. (2019, June). Incorporating Full Text and Bibliographic Features to Improve Scholarly Journal Recommendation. In 2019 ACM/IEEE Joint Conference on Digital Libraries (JCDL) (pp. 374-375). IEEE. [Core Rank A*]

[P14] Ghosal, T., Verma, R., Ekbal, A., & Bhattacharyya, P. (2019, June). A Sentiment Augmented Deep Architecture to Predict Peer Review Outcomes. In 2019 ACM/IEEE Joint Conference on Digital Libraries (JCDL) (pp. 414-415). IEEE. [Core Rank A*]

[P15] Ghosal, T., Verma, R., Ekbal, A., & Bhattacharyya, P. (2019, July). DeepSentiPeer: Harnessing Sentiment in Review Texts to Recommend Peer Review Decisions. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (pp. 1120-1130). [H-Index: 105, Core Rank A*]

[P16] Ghosal, T., Edithal, V., Ekbal, A., Bhattacharyya, P., Chivukula, S. S. S. K., & Tsatsaronis, G. (March, 2020). Is your document novel? Let attention guide you. An attention-based model for document-level novelty detection. Natural Language Engineering, 1-28. [H-Index 20, IF: 1.13]

[P17]  Ghosal, T., Verma, R., Ekbal, A., Saha, S., & Bhattacharyya, P. (2020, November). What is the Scope of your Paper? An Experimental Study of Scope Detection Across Two Computer Science Journals. In Proceedings of the 22nd International Conference on Asia-Pacific Digital Libraries. [Core Rank A]

[P18] Muthu Kumar Chandrasekaran, Anita de Waard, Guy Feigenblat, Dayne Freitag, Tirthankar Ghosal, Eduard Hovy, Petr Knoth, David Konopnicki, Philipp Mayr, Robert M Patton, Michal Shmueli-Scheuer (2020) Overview of the First Workshop on Scholarly Document Processing In Conference on Empirical Methods in Natural Language Processing (EMNLP 2020).

[P19] Ghosal, Tirthankar. 2020. Towards Computational Analysis of Peer Reviews (Extended Abstract). In the Workshop on Informetrics and Scientometrics Research (SIG/MET).

[P20] Ghosal, Tirthankar. 2020. Towards Establishing a Research Lineage via Identification of Significant Citations (Extended Abstract). In the Workshop on Informetrics and Scientometrics Research (SIG/MET).

[P21] Ghosal, T., Edithal, V., & Ekbal, A., Finding Newness: Leveraging Memory Networks to Detect Document Novelty (Abstract). In West Coast NLP (WeCNLP) 2020.

[P22] Ghosal, T., Biswas, T., & Ekbal, A., Leveraging Multi-Premise Inference for Document-Level Novelty Detection (Abstract). In West Coast NLP (WeCNLP) 2020.

[P23]  Kaushik, V.K., Ghosal, T., & Kordoni, V. Additional Context Helps! Leveraging Cited Paper Information To Improve Citation Classification In Proceedings of ISSI 2021.

[P24] Hardik Arora, Tirthankar Ghosal, Sandeep Kumar, Suraj Patwal, and Phil Gooch. INNOVATORS at SemEval-2021 Task-11 NLP Contribution Graph Challenge: A Dependency Parsing and BERT-based model for Extracting Contribution Knowledge from Scientific Papers. (To Appear) In Proceedings of SemEval 2021 at ACL-IJCNLP 2021

[P25] Rina Kumari, Nischal Ashok, Tirthankar Ghosal, and Asif Ekbal. A Multitask Learning Approach for Fake News Detection: Novelty, Emotion, and Sentiment Lend a Helping Hand. (To Appear) In Proceedings of International Joint Conference on Neural Networks (IJCNN) 2021

[P26] Khalid Al-Khatib, Tirthankar Ghosal, Anita de Waard, Dayne Freitag, and Yufang Hou. Argument Mining for Scholarly Document Processing: Taking Stock and Looking Ahead (To Appear) In Proceedings of The Second Workshop on Scholarly Document Processing at NAACL 2021

[P27] Rina Kumari, Nischal Ashok, Tirthankar Ghosal, and Asif Ekbal. Misinformation detection using multi-task learning with mutual learning for novelty detection and emotion recognition in Information Processing and Management Journal (Elsevier), 2021

[P28] Ghosal, T., & Singh, M. Tracing Idea Propagation in a Scholarly Network via Identification of Meaningful Citations (To Appear) in the 84th Annual Meeting of the Association for Information Science and Technology (ASIS&T), 2021 (Also at WiNLP 2021 as an extended abstract)

[P29] Iz Beltagy, Arman Cohan, Guy Feigenblat, Dayne Freitag, Tirthankar Ghosal, Keith Hall, Drahomira Herrmannova, Petr Knoth, Kyle Lo, Philipp Mayr, Robert M. Patton, Michal Shmueli-Scheuer, Anita de Waard, Kuansan Wang, Lucy Lu Wang (2021) Overview of the Second Workshop on Scholarly Document Processing In NAACL 2021.

[P30] Kaushik, V.K., Ghosal, T., Tiwari, P. & Singh, M. IITP@3C 2021: Citation Classification (Task A) and Citation Significance Detection (Task B) In Proceedings of The Second Workshop on Scholarly Document Processing at NAACL 2021

[P31] Kumar, A., Ghosal, T. & Ekbal, A. A Deep Neural Architecture for Decision-Aware Meta-Review Generation (To Appear) In Proceedings of The ACM/IEEE Joint Conference on Digital Libraries (JCDL) 2021

[P32] Kumar, S., Ghosal, T., Bharti, P. & Ekbal, A. (2021) Sharing is Caring! Joint Multitask Learning Helps Aspect-Category Extraction and Sentiment Detection in Scientific Peer Reviews In Proceedings of The ACM/IEEE Joint Conference on Digital Libraries (JCDL) 2021

[P33] Guneet Singh Kohli, Prabsimran Kaur, Muskaan Singh, Tirthankar Ghosal and Prashant Rana (2021) ARGUABLY @ AI Debater-NLPCC 2021 Task 3: Argument Pair Extraction from Peer Review and Rebuttals In Proceedings of NLPCC-Evaluation 2021 (To Appear)

[P34] Rina Kumari, Nischal Ashok, Tirthankar Ghosal, and Asif Ekbal. What the fake? Probing Misinformation Detection Standing on the Shoulder of Novelty and Emotion in Information Processing and Management Journal (Elsevier), 2021

[P35] Ghosal, T., Tiwary, P., Patton, R., & Stahl, C. Towards Establishing a Research Lineage via Identification of Meaningful Citations in Quantitative Science Studies (MIT), 2021

[P36] Sandeep Kumar, Tirthankar Ghosal, & Asif Ekbal. DataQuest: An Approach To Automatically Extract Dataset Mentions From Scientific Papers In Proceedings of the 23rd International Conference on Asia-Pacific Digital Libraries (ICADL 2021) (To Appear).

[P37] Nishith Kotak, Anil K. Roy, Sourish Dasgupta, and Tirthankar Ghosal. A Consistency Analysis of NLP Approaches for Reviewer-Manuscript Matching In Proceedings of the 23rd International Conference on Asia-Pacific Digital Libraries (ICADL 2021) (To Appear).

[P38] Komal Gupta, Ammaar Ahmad, Tirthankar Ghosal, and Asif Ekbal. ContriSci: A BERT-based Multitasking Deep Neural Architecture to Identify Contribution Statements from Research Papers In Proceedings of the 23rd International Conference on Asia-Pacific Digital Libraries (ICADL 2021) (To Appear) (Also at WiNLP 2021 as an extended abstract)

[P39] Prabhat Kumar Bharti, Shashi Ranjan, Tirthankar Ghosal, Asif Ekbal, and Mayank Agarwal. PEERAssist: Leveraging on Paper-Review Interactions To Predict Peer Review Decisions In Proceedings of the 23rd International Conference on Asia-Pacific Digital Libraries (ICADL 2021) (To Appear).

[P40] Rajeev Verma, Kartik Shinde, Hardik Arora and Tirthankar Ghosal. Attend To Your Review: A Deep Neural Network to Extract Aspects from Peer Reviews In Proceedings of the 28th International Conference on Neural Information Processing (ICONIP 2021) (To Appear).

[P41] Komal Gupta, Tirthankar Ghosal, and Asif Ekbal. A Neuro-Symbolic Approach for Question Answering on Research Articles In Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation (PACLIC 35) (To Appear).

[P42] Muskaan Singh, Tirthankar Ghosal, and Ondrej Bojar. An Empirical Performance Analysis of State-of-the-Art Summarization Models for Automatic Minuting In Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation (PACLIC 35) (To Appear).

[P43] Tirthankar Ghosal, Sandeep Kumar, Prabhat Kumar Bharti, and Asif Ekbal. Peer Review Analyze: A Novel Benchmark Resource for Computational Analysis of Peer Reviews. In Plos One, 2021 (To appear)


COMMUNICATED/IN-PROGRESS WORKS (2021)

[P44] Memory Networks for Document-Level Novelty Detection. [In-Progress]

[P45] Which reviews are important? Investigations on Significance of Peer Reviews. [Communicated, 2021]

[P46] A novelty-focussed multimodal investigation on misinformation [In Progress]

[P47] A Survey on Automatic Minuting. [In-Progress]

[P48] Can we do away with the human evaluation of meeting summaries? Not yet! [In Progress]

[P49] A Multi-tasking Deep Neural Architecture for Citation Intent Classification. [Communicated, 2021]

[P50] Textual Novelty Detection in Community Question Answering Forums. [In-Progress]

[P51] A Longitudinal Survey on Semantic Similarity Methods [Communicated, 2021]

[P52] A Longitudinal Survey on Document-Level Novelty Detection [In-Progress]

[P53] A Large Scale Dataset for Question Answering on Scientific Literature. [Communicated, 2020]

[P55] Empirical Studies on COVID-19 Misinformation [Communicated, 2021]

[P54] A Multimodal Dataset for Fake News Detection [Communicated, 2021]

[P56] Aspect Extraction from Peer Reviews [In Progress].

[P57] Textual Novelty Detection: An NLP Perspective. [Under Revision, 2020]

[P58] A Pipeline method for Peer Review Decision Prediction and Meta Review Generation [In Progress]

[P59] Can you Estimate the Reviewer Knowledge? [In Progress]

[P60] Aspect-focussed Review Summarization [In Progress]


RESEARCH COLLABORATIONS


PROJECTS

  • Principal Investigator of a project on Legal Language Processing with a US-based Legal Corporation
  • Co-investigator on Argumentation Mining and Scholarly Discourse Processing with Research Assistant X (RAx)
  • Principal Investigator of a project on Peer Reviews

COMMUNITY SERVICE/ORGANIZATIONAL ROLES


HONORS AND AWARDS

  • Awarded the University First Rank medal for securing the highest marks in BCA examinations under University of North Bengal (Session: 2006-09). The medal was awarded in April, 2010.
  • Awarded the University First Class First Gold medal for securing the highest marks in MCA examinations under University of North Bengal (Session: 2009-12). Awarded on 26th February, 2015.
  • Awarded the Jindal Jubilee Gold medal for securing the highest marks in MCA under University of North Bengal (Session: 2009-12). Awarded on 26th February, 2015.
  • Awarded the Late Sourav Bhattacharya Memorial medal for securing the highest marks in MCA under University of North Bengal (Session: 2009-12). Awarded on 26th February, 2015.
  • Qualified UGC-NET, WB-SET, WB-TET for Assistant Professorship in universities/colleges in India
  • Awarded high-value Visvesvaraya fellowship (MEITY) to pursue Ph.D. in IIT Patna in January,2016 (only awarded to 500 Computer Science candidates till date)
  • Invited to present research on Novelty Detection at IRISS 2019, ACM India Annual Event held at Kochi, India from February 6-8, 2019 with funding from ACM India Chapter (Top 15 Computer Science PhDs).
  • Awarded the Augmenting Writing Skills for Articulating Research (AWSAR) 2018 award for popular science writing on Artificial Intelligence in Peer Review by Department of Science and Technology (DST), Government of India
  • Awarded the best poster at FORCE11 2019 conference for the work “Novelty, Scope, Quality: Exploring Artificial Intelligence for Scholarly Communications” at the University of Edinburgh, UK
  • Awarded the Best Student Research Paper at the SIG/MET METRICS ASIS&T Workshop 2020
  • Invited to participate in the European NLP Summit at Facebook Headquarters, London, UK on October 11, 2019
  • Selected as a visiting researcher in Harvard-Smithsonian Institute, Harvard University, US in a NASA (Astrophysics Data System) project
  • Selected as a visiting researcher via the ORISE program in the Oak Ridge National Laboratory, US (March 2019-August 2019)
  • Awarded for the Best Research Poster in the Research Scholar’s Day (RSD) 2020 at IIT Patna
  • Invited in FORCE11 2019 conference to present on AI for Peer Review held at the University of Edinburgh, UK from October 15-17, 2019 with full sponsorship.
  • Invited in FORCE11 2018 conference to present on AI in Scholarly Communications held at McGill University, Montreal from October 10-11, 2018 with full sponsorship awarded by CrossRef
  • Selected for MHRD fellowship for pursuing Ph.D. in IIT Patna in January,2016
  • Selected for best research presentation from CSE Department at Research Scholars Day, IIT Patna
  • Selected for ACM-SIGCHI Student Support Travel Grant to attend and present current work on recommendation systems for academic journals (if accepted) in ACM RecSys 2018 held in Vancouver, Canada from October 2-7, 2018
  • Awarded the ACM SIGIR Student Travel Grant 2018 to attend and present in the 18th ACM/IEEE Joint Conference on Digital Libraries (JCDL) 2018 held at Fort Worth, Texas, US from June 3-6, 2018
  • Awarded the Microsoft Research Travel Grant 2018 to attend and present in the 27th International Conference on Computational Linguistics (COLING 2018) held at Santa Fe, New-Mexico, USA from August 20-26, 2018
  • Awarded the Visvesvaraya Conference Grant to attend and present in COLING 2018
  • Awarded the Microsoft Research Travel Grant 2019 to attend and present in the 37th IEEE International Joint Conference on Neural Networks (IJCNN) 2019 held in Budapest, Hungary from July 14-20, 2019
  • Has been awarded the ACM SIGIR Student Travel Grant 2019 to attend the 19th ACM/IEEE Joint Conference on Digital Libraries (JCDL) 2019 held at the University of Illinois Urbana Champaign, US from June 2-6, 2019
  • Has been awarded the ACM-India IARCS Travel Grant 2019 to attend the 19th ACM/IEEE Joint Conference on Digital Libraries (JCDL) 2019 held at the University of Illinois Urbana Champaign, US from June 2-6, 2019
  • Selected to represent IIT Patna contingent in the Students Academic Conference at Inter IIT Tech Meet 8.0 at IIT Roorkee
  • Selected to represent the Department of Computer Science and Engineering at the Research Scholar Day in 2017 and 2020, IIT Patna.
  • Selected for the Spring Mentorship Cohort 2020 in the Society of Scholarly Professionals (SSP)
  • Our team is the Third Prize winner in the AI Debater Challenge on Argument Pair Extraction task at NLPCC2021

CONFERENCE VISITS/PRESENTATIONS

  • Attended and Presented in the 18th ACM/IEEE Joint Conference on Digital Libraries (JCDL) 2018 held at Fort Worth, Texas, US from June 3-6, 2018
  • Attended and Presented in the 27th International Conference on Computational Linguistics (COLING) 2018 held at Santa Fe, New-Mexico, USA from August 20-26, 2018
  • Attended the Fourth Research Workshop for the “Visvesvaraya PhD Scheme for Electronics & IT” at MNIT Jaipur during 13th-15th September, 2018.
  • Attended and Presented in the Future of Research Communication and e-Scholarship (FORCE) conference 2018 held at McGill University, Montreal, Canada from October 10-11, 2018 
  • Presented AI for Scholarly Communications at the Text Mining workshop organized by the Computational Data Analytics group at the Oak Ridge National Laboratory on November 2018
  • Attended and Presented in the 13th Inter Research Institute Student Seminar in Computer Science 2019, ACM India Annual Event held at Kochi, India from February 6-8, 2019
  • Attended and Presented in the 19th ACM/IEEE Joint Conference on Digital Libraries (JCDL) 2019 held at the University of Illinois Urbana-Champaign, US from June 2-5, 2019
  • Attended and Presented in the Artificial Intelligence (AI) Expo at the Oak Ridge National Laboratory, US on July 29, 2019
  • Attended and Presented in the 7th Annual Oak Ridge Postdoctoral Association Research Symposium at the Oak Ridge National Laboratory, US on August 6, 2019
  • Participated in the European NLP Summit at Facebook Headquarters, London, UK on October 11, 2019
  • Attended and Presented in the Future of Research Communication and e-Scholarship (FORCE) conference 2019 held at the University of Edinburgh, Scotland, UK from October 15-17, 2019
  • Delivered an Elsevier Lab Online Lecture on Novelty Detection held online on October 6, 2020
  • Presented at the ASIS&T SIG/MET workshop on October 22, 23 on Computational Analysis of Peer Reviews and Establishing a Research Lineage
  • Presented at West Coast NLP 2020 on October 30, 2020 on Textual Novelty Detection.
  • Presented at ICADL 2020 on November 30 on Scope Detection in Scientific Publications.
  • Delivered a tutorial on Natural Language Processing at Manipal Academy of Higher Education on December 17, 2020

REFERENCES

  1. Asif Ekbal, Associate Professor, Indian Institute of Technology Patna
  2. Valia Kordoni, Associate Professor, Humboldt University Berlin
  3. Robert Patton, Learning Systems Group Lead, Oak Ridge National Laboratory, US
  4. John Chodacki, Director, University of California Curation Center, US
  5. Ondrej Bojar, Institute of Formal And Applied Linguistics, Charles University, Prague