We have two new papers accepted to the LREC 2022 conference that takes place in Marseille the upcoming June!
The first paper is authored by Thomas Vakili, Aron Henriksson, Hercules Dalianis, and me and is called “Downstream Task Performance of BERT Models Pre-Trained Using Automatically De-Identified Clinical Data” with code 412. It evaluates the performance of a language model that is trained using De-identified clinical text in later tasks, and explores the de-identification impact in the development of the language model.
The second paper is authored by Aron Henriksson, Hercules Dalianis, and me and is called “Evaluating Pre-training Strategies for Clinical BERT Models” with code 661. It empirically compares different pre-training strategies for the development of domain-adapted language models in the Swedish clinical text domain.
You can find all the accepted papers including the ones mentioned above here!
The application submitted in this year to Swedish Research School of Management and Information Technology (MIT) for Ph.D. funding for research in IT management and governance at DSV/Stockholm University has been successfully. In the next five years DSV will receive 1.750.000 SEK from MIT for co-financing a Ph.D. position in IT management and governance. For more information about Swedish Research School of Management and Information Technology (MIT) please access the following link: http://www.mit.uu.se/
Professor Hercules Dalianis and I got a paper about the privacy preserving qualities of BERT accepted to the AAAI Fall Symposium on Human Partnership with Medical Artificial Intelligence! The paper is titled Are Clinical BERT Models Privacy Preserving? The Difficulty of Extracting Patient-Condition Associations. Our results strongly suggest that BERT’s poor generative capabilities makes it resistant to training data extraction attacks. Other models, such as GPT-2, have been shown to be susceptible to these attacks. From a privacy perspective, being a poor generator may be a feature!
Later in the same week, I flew from Stockholm to Punta Cana in the Dominican Republic to participate at EMNLP 20211. Almost 500 participants were there, with the total number of participants exceeding 4,000. There were many interesting presentations regarding NLP in general, but also some that were specifically about the privacy aspects of NLP. It was a great experience to learn where the field is headed and also to get to know many talented researchers. I have written a summary of some of the interesting papers – reach out if you are interested in it.
1: EMNLP stands for Empirical Methods in Natural Language Processing
My name is Yash. I recently started working at DSV as a Research Assistant with Aron Henriksson. I will be working on a project that is aimed at developing Machine Learning models for the outcome prediction of COVID-19 hospitalizations using structured and text data. I will also be working as a Teaching Assistant for the courses BIG DATA and NLP.
I recently completed my Masters degree in Statistics and Machine Learning from Linköping University and hold a Bachelors in Electronics and Telecommunications from the College of Engineering Pune, India.
I am glad to be here and I look forward to meeting you all! 🙂
Professor Hercules Dalianis, Sonja Remmer and myself represented DSV at the First ClinCode Conference. The conference gathered experts in medicine and computer science from across the Nordics and took place at the University Hospital of North Norway (UNN) in Tromsø.
The conference was chaired by Hercules, who is also a guest professor at the Norwegian Centre for E-health Research. Sonja shared her work on automatic ICD-10 classification using BERT and I spoke about the difficulty of extracting training data from clinical BERT models.
Several participants had an industry or medical background. This provided valuable insights into how our research at DSV may be used in practice and what challenges are most important. It also highlighted the great potential that can be unlocked by continuing to investigate ICD-10 classification and other medical NLP problems.
Many excellent ideas were hatched in the discussions, and it was lovely to visit the beautiful polar city of Tromsø. Personally, I really look forward to future iterations of the conference!
I am very happy and proud that Association for Information Systems (AIS) member program has recognized my activity to have a significant impact on the association and made me a Distinguished Member of AIS (https://aisnet.org/page/DistinguishedMemberList).
The results of the research work done by Rahmat Mulyana, Lazar Rusu and Erik Perjons (all members of research group in IT Management and Governance at DSV) in IT Governance and Digital Transformation areas has raised the interest of the worldwide IS community, like is the the Americas Conference on Information Systems (AMCIS) conference, that is a top conference in IS area. Their paper entitled IT Governance Mechanisms Influence on Digital Transformation: A Systematic Literature Review published in AMCIS 2021 Proceedings is mentioned among the Most Popular Papers at AMCIS 2021 (https://aisel.aisnet.org/amcis2021/topdownloads.html).