<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="3.10.0">Jekyll</generator><link href="https://log.lab.matkelly.com/feed.xml" rel="self" type="application/atom+xml" /><link href="https://log.lab.matkelly.com/" rel="alternate" type="text/html" /><updated>2026-04-03T21:12:32+00:00</updated><id>https://log.lab.matkelly.com/feed.xml</id><title type="html">MAT Lab Lab Log</title><subtitle>Writings and Reports from Mat Kelly&apos;s Research Group at Drexel CCI</subtitle><author><name>Mat Kelly</name></author><entry><title type="html">Append the LM to the IR</title><link href="https://log.lab.matkelly.com/info624-2026" rel="alternate" type="text/html" title="Append the LM to the IR" /><published>2026-04-03T00:00:00+00:00</published><updated>2026-04-03T00:00:00+00:00</updated><id>https://log.lab.matkelly.com/info624-2026</id><content type="html" xml:base="https://log.lab.matkelly.com/info624-2026"><![CDATA[<p>From January to March 2026, I taught <span style="color: red;">INFO624: Intelligent Search and Language Models</span> at Drexel CCI—a course that sits at the intersection of classical information retrieval (IR) and modern AI-driven language models.
<!-- endexcerpt --></p>

<p>This offering marked a deliberate shift from previous iterations of INFO624. While earlier versions focused on traditional IR systems, this course expanded to explore how language models are reshaping the way we search, rank, and interact with information. In many ways, the guiding question became: what does it mean to <em>append the LM to the IR</em>?</p>

<p>The course was delivered in a cross-listed format, with a mix of in-person and asynchronous students. Preparing and teaching it required not just updating materials, but continuously adapting to a rapidly evolving technical landscape—one where best practices can shift within months.</p>

<h1 id="topics">Topics</h1>

<p>Despite losing two instructional days (MLK Day and a late-January snowstorm), the course covered eight weeks of material spanning both foundational and emerging topics:</p>

<ul>
<li>Introduction to IR and AI foundations</li>
<li>Text Processing and AI-enchanced pre-processing</li>
<li>From Vector Space Models to Dense Representations</li>
<li>Probablistic Models and Neural Language Models for IR</li>
<li>AI-Driven Web Search and Retrieval Techniques</li>
<li>Graph Analysis and Neural Linking Models</li>
<li>Evaliation Metrics and AI-Enhanced IR Systems</li>
<li>Relevance Feedback with AI Techniques</li>
<li>Clustering and Classification with Deep Learning</li>
<li>Emerging Topics in AI (e.g., RAG, XAI, Multimodal IR)</li>
</ul>

<p>Each topic could easily warrant a full course on its own, but the goal here was breadth with meaningful depth—enough to ground students before they explored ideas in their projects.</p>

<h1 id="student-projects">Student Projects</h1>

<p>The course enrolled 20 students, who could choose to work individually or in groups. Projects took one of two forms: (1) an IR/AI-focused literature review or (2) the design and evaluation of a working system. In total, 12 projects were submitted, reflecting a wide range of interests across modern information retrieval and language model integration.</p>

<p><strong>Systems</strong></p>

<p>Omkar, Manjiri, and Priti developed a multi-source search system that retrieves, synthesizes, and self-evaluates information from web, academic, and local data to generate comprehensive, cited answers.<br />
• <a href="https://github.com/Priti0427/Intelligent-Search-agent">https://github.com/Priti0427/Intelligent-Search-agent</a></p>

<p>Mokshad and Ishant built a search engine over arXiv papers that combines BM25 with BERT-based retrieval, while providing transparent explanations for ranking decisions.<br />
• <a href="https://github.com/Mokshu3242/arXiv-Paper-Search-System">https://github.com/Mokshu3242/arXiv-Paper-Search-System</a></p>

<p>Ian built a two-stage recipe search engine on the Food.com corpus (~230K recipes, 1.1M reviews), integrating BM25 retrieval, rule-based query alignment, and neural embeddings derived from review-based quality signals.<br />
• <a href="https://github.com/iauger/recipe-search-engine">https://github.com/iauger/recipe-search-engine</a></p>

<p>Chinomso designed a system for question answering over PDFs that incorporates document structure (sections and hierarchy) into both retrieval and grounded generation.<br />
• <a href="https://github.com/MishaelTech/explanable_structured_rag_pdf">https://github.com/MishaelTech/explanable_structured_rag_pdf</a></p>

<p>Charles implemented a transparent full-text search engine over newly released JFK assassination documents, enabling precise and citable exploration of primary historical sources.</p>

<p>Robert and Ayush created a system that combines chapter-level character summaries with semantic retrieval to support exploration and querying of long-form narrative texts.</p>

<p>Jake developed a prototype system using FAISS, augmented with salience and recency signals, to retrieve narrative memories for consistent storytelling in AI-driven environments.</p>

<p>Mason built a RAG-based search engine for personal finance, retrieving and summarizing trusted financial documents to answer user questions in natural language.<br />
• <a href="https://github.com/riccimason99/Financial-Planning-Search-Engine">https://github.com/riccimason99/Financial-Planning-Search-Engine</a></p>

<p><strong>Literature Reviews</strong></p>

<p>Sriram, Sourav, Khushi, and Lohitha conducted a survey of retrieval-augmented generation (RAG) methods for academic use, focusing on hybrid retrieval, self-reflection, and challenges such as faithfulness and evaluation.</p>

<p>Muhammad analyzed the evolution of neural information retrieval, tracing the progression from early embeddings to modern transformer-based dense retrieval and identifying remaining challenges.</p>

<p>Sriram examined personalization in search, exploring how systems balance relevance with novelty and diversity under ambiguous or evolving user intent.</p>

<p>Grace compared thesauri, knowledge graphs, and latent semantic analysis as methods for incorporating semantic relationships into retrieval systems.</p>

<h1 id="conclusion">Conclusion</h1>

<p>Overall, INFO624 highlighted just how quickly information retrieval and language models are converging—both in research and in practice. What once felt like separate paradigms are now deeply intertwined, with modern systems blending classical ranking methods and neural representations into hybrid approaches.</p>

<p>The range of student projects reflects this shift clearly: systems emphasized not only performance, but also transparency, evaluation, and real-world usability. Just as importantly, many projects grappled with emerging challenges such as faithfulness, explainability, and the limits of current models.</p>

<p>For me, teaching this course reinforced an important reality: working in this space requires constant adaptation. The tools, techniques, and expectations are evolving rapidly, and education must evolve with them. If anything, this iteration of INFO624 felt less like a static course and more like a snapshot of a moving target—one that students are now well-equipped to continue exploring.</p>]]></content><author><name>Mat Kelly</name></author><category term="teaching" /><category term="info624" /><category term="2026" /><category term="information" /><category term="retrieval" /><category term="language" /><category term="models" /><category term="ai" /><category term="drexel" /><category term="cci" /><category term="students" /><category term="projects" /><summary type="html"><![CDATA[Report on INFO624 Intelligent Search and Language Models at Drexel CCI in Winter 2026 with course topics, student projects, and lessons learned.]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://log.lab.matkelly.com/assets/2026-04-03/twitter-card.png" /><media:content medium="image" url="https://log.lab.matkelly.com/assets/2026-04-03/twitter-card.png" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">Lessons in Reteaching Web Programming</title><link href="https://log.lab.matkelly.com/info655-2025" rel="alternate" type="text/html" title="Lessons in Reteaching Web Programming" /><published>2025-03-29T00:00:00+00:00</published><updated>2025-03-29T00:00:00+00:00</updated><id>https://log.lab.matkelly.com/info655-2025</id><content type="html" xml:base="https://log.lab.matkelly.com/info655-2025"><![CDATA[<p>From January to March (Winter Quarter) 2025, I instructed the course <span style="color: red;">INFO655: Intro to Web  Programming</span> at <a href="https://drexel.edu/cci/">Drexel CCI</a>. 
<!-- endexcerpt --></p>

<p><img src="/assets/2025-03-29/react.png" alt="React Logo &gt;" class="logo" /></p>

<p>This was the second iteration of a course I revamped at the college after it had not been offered for over a decade and its previous incarnation using Java in the course description. The mutual decision between me and <a href="https://lincs.cci.drexel.edu/weimao/">Dr. Weimao Ke</a> (who teaches the follow-on course, INFO670: Cross-Platform Mobile Development) was to use the <a href="https://react.dev/">React Framework</a> due to its established community, support, documentation, etc. rather than adopting a more modern framework with the expectation that React fundamentals could be extrapolated to these others by the students in the future if interested.</p>

<p>As I have not written much about teaching here, I am using this waypoint to enumerate the topics we covered, the lessons learned in the second offering of the course, highlight students’ projects, and enumerate improvements I plan to make for next year’s offerings.</p>

<h1 id="topics">Topics</h1>

<p>The 2025 edition of the course had 16 graduate students, split between two sections: 6 in the face-to-face section and 9 in an online section. All students were delivered the same material with the expectation that the face-to-face students would attend a synchronous lecture and interactive session while the online students were permitted to participate asynchronously over the 10-week period. The students came from a variety of Master’s degree-seeking programs at Drexel ranging from <a href="https://drexel.edu/cci/academics/masters-programs/ms-in-information-human-computer-interaction-ux/">HCI</a> to <a href="https://drexel.edu/engineering/academics/graduate-programs/masters/robotics-autonomy/">Robots and Automation</a> with the largest majority coming from the <a href="https://drexel.edu/cci/academics/masters-programs/ms-in-information-systems/">Information Systems</a> and <a href="https://drexel.edu/cci/academics/masters-programs/ms-in-computer-science/">Computer Science</a> majors. This variety of background necessitated a web programming primer for the first week of class to (potentially re-)introduce HTML, CSS, and JavaScript and highlight React’s place in both building up and straddling these technologies with its state management and <a href="https://react.dev/learn/writing-markup-with-jsx">JSX</a>-based rendering. In the following weeks we took a deeper dive into React (<span style="color: red;">Week 2</span>), learned about conditional rendering and server basics (<span style="color: red;">Week 3</span>), explored the dynamics and practice of passing props (<span style="color: red;">Week 4</span>), and learned how to write test cases for components (<span style="color: red;">Week 5</span>).</p>

<p>Students were given a midterm exam in <span style="color: red;">Week 6</span> as well as a session of optimizing performance in their component hierarchies. In <span style="color: red;">Week 7</span> we discussed deployment and in <span style="color: red;">Week 8</span> GraphQL. For the last lecture session (<span style="color: red;">Week 9</span>) of the term, we went over Authentication and some advanced topics to prepare interested students for the follow-on course (INFO670). Throughout the term, students progressively worked on a progressively built series of assignments that had them build an audio playlist (without the need to integrate actual audio) to get hands-on experience with the technologies and approach they were learning about. Also throughout the term, students were required to give project checkins, essentially milestones, for their course project that they individually built and presented in <span style="color: red;">Week 10</span>. Online students were given the option to post their presentations to the Blackboard discussion board and other students were encouraged to give feedback to their peers’ projects with the expectation that other students could comment and their their peers’ projects as well.</p>

<h1 id="student-projects">Student Projects</h1>

<p>During the first week of the term, students were required to declare the course project they hoped to build throghout the term. While they were permitted to change the project if deemed unfeasible, most students stuck with it and were able to implement new concepts as we discussed them in class. All students were required to post their code and (ideally) instructions to run their React application onto GitHub. I would like to highlight some of the projects here.</p>

<p><img src="/assets/2025-03-29/project_nicole.png" class="floatRightScreenshot" /></p>

<p><span style="font-weight: bold;">Nicole</span> created an interface to build custom avatars, implementing the hair, eyes, mouth, etc. as reusable components.</p>

<ul>
<li>Source: <a href="https://github.com/ideknic/AvatarCreator">https://github.com/ideknic/AvatarCreator</a></li>
<li>Deployment: <a href="https://avatarcreator.surge.sh">https://avatarcreator.surge.sh</a></li>
</ul>

<p><img src="/assets/2025-03-29/project_athmeeya.png" class="floatRightScreenshot" /></p>

<p><span style="font-weight: bold;">Athmeeya</span> created a study planner that incorporated task management, notifications, and dark mode.</p>

<ul>
<li>Source: <a href="https://github.com/AthmeeyaM/INFO-655-project-Study-Planner-">https://github.com/AthmeeyaM/INFO-655-project-Study-Planner-</a></li>
<li>Deployment: <a href="https://athmeeyam.github.io/INFO-655-project-Study-Planner-/">https://athmeeyam.github.io/INFO-655-project-Study-Planner-/</a></li>
</ul>

<p><img src="/assets/2025-03-29/project_vanshika.png" class="floatRightScreenshot" /></p>

<p><span style="font-weight: bold;">Vanshika</span> built a personalized travel hub to allow users to document photos of their trip, budget expenses, and organize a packing list.</p>
<ul>
<li>Source: <a href="https://github.com/V-Security-beep/PERSONAL-TRAVEL_companion">https://github.com/V-Security-beep/PERSONAL-TRAVEL_companion</a></li>
<li>Deployment: <a href="https://v-security-beep.github.io/PERSONAL-TRAVEL_companion/">https://v-security-beep.github.io/PERSONAL-TRAVEL_companion/</a></li>
</ul>

<p><img src="/assets/2025-03-29/project_brenda.png" class="floatRightScreenshot" /></p>

<p><span style="font-weight: bold;">Brenda</span> built a fitness stopwatch that allows users to track timed sessions, splits, and  progress over time.</p>
<ul>
<li>Source: <a href="https://github.com/br534/BeginMotion">https://github.com/br534/BeginMotion</a></li>
</ul>

<p><br style="clear: both;" /></p>

<h1 id="future-improvements">Future Improvements</h1>

<p>The need to overhaul this course due to the usage of older technologies was felt even between the first and second offerings of this course. In December 2024, as I prepare to teach the course in January 2025, <a href="https://react.dev/blog/2024/12/05/react-19">React 19</a> was released, which streamlined some aspects of caching (e.g., <a href="https://react.dev/reference/react/memo">memoization</a> can be handled by <a href="https://react.dev/learn/react-compiler">React Compiler</a>), changed the preference of boilerplate generators (<a href="https://vite.dev/">Vite</a> over the now-deprecated <a href="https://create-react-app.dev/">create-react-app</a>), and provided full support for <a href="https://custom-elements-everywhere.com/">custom elements everywhere</a>.</p>

<p>On the other end, the 2024 edition of INFO655 had students use <a href="https://jestjs.io/">Jest</a> and <a href="https://testing-library.com/docs/react-testing-library/intro/">React’s Testing Library</a> for the preferred testing framework whereas this proved difficult for some students in React 19 due to different preferred frameworks. This aspect, unfortunately, was not incorporated into the 2025 course due to my late realization. It will be updated in the 2026 iteration.</p>

<p>Additionally, students expressed in their feedback that the number of checkins required was a bit too numerous. The original basis for them was to keep students on-track and to address any difficulties they were encountering in building their project. In the future, this feedback will be incorporated with fewer checkins more evenly distributed over the term.</p>

<h1 id="conclusion">Conclusion</h1>

<p>As a course of my own design, I have enjoyed getting the opportunity to polish the rough edges with two cohorts of students and know there is still much to be improved. In the future, at some point, Dr. Ke and I may move on from React as long as newer web programming paradigms can be implemented just as easily by students from diverse academic backgrounds.</p>]]></content><author><name>Mat Kelly</name></author><category term="teaching" /><category term="info655" /><category term="2025" /><category term="winter" /><category term="react" /><category term="web" /><category term="programming" /><category term="drexel" /><category term="cci" /><category term="hci" /><category term="robotics" /><category term="students" /><category term="projects" /><summary type="html"><![CDATA[Report on INFO655 Intro to Web Programming at Drexel CCI in Winter 2025 with course topics, student projects, and lessons learned.]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://log.lab.matkelly.com/assets/2025-03-29/twitter-card2.jpg" /><media:content medium="image" url="https://log.lab.matkelly.com/assets/2025-03-29/twitter-card2.jpg" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">Preserving Personalized Advertisements for More Accurate Web Archives</title><link href="https://log.lab.matkelly.com/imls-webads-2024" rel="alternate" type="text/html" title="Preserving Personalized Advertisements for More Accurate Web Archives" /><published>2024-07-31T00:00:00+00:00</published><updated>2024-07-31T00:00:00+00:00</updated><id>https://log.lab.matkelly.com/imls-webads-2024</id><content type="html" xml:base="https://log.lab.matkelly.com/imls-webads-2024"><![CDATA[<p><img src="/assets/2024-07-31/mask-ad-not-covid-article.png" alt="COVID Web Ad &gt;" width="300" class="centered fullwidthmobile" /></p>

<p>Advertisements on the web are annoying to experience yet when preserved, are useful for analyzing our digital pasts. In collaboration with <a href="https://oduwsdl.github.io/">Old Dominion University’s Web Science and Digital Libraries (WS-DL) Research Group</a> (<a href="https://x.com/webscidl">@WebSciDL</a>), we at <a href="https://drexel.edu/cci/">Drexel CCI</a> (<a href="https://x.com/DrexelCCI">@DrexelCCI</a>) are pleased to announce that the follow-on to our “<a href="https://www.imls.gov/grants/awarded/lg-252362-ols-22">Saving Ads…</a>” (see <a href="/imls-webads">blog post</a>) project, “<a href="https://www.imls.gov/grants/awarded/lg-256695-ols-24">Preserving Personalized Advertisements for More Accurate Web Archives</a>”, has been selected for funding from <a href="https://www.imls.gov/">IMLS</a> starting in August 2024!</p>

<p>The project will run for 2 years and focus on preserving personalized advertisements on the web—the ones that are tailored to individuals or specific demographics—with the realization that preserving dynamic, much less personalized web content, is difficult and frequently not performed.</p>

<p><img src="/assets/2024-07-31/scheduleOfCompletion.png" alt="Schedule of Completion &gt;" width="300" class="centered fullwidth fullwidthmobile" /></p>

<p><img src="/assets/logos/imls-logo.png" alt="IMLS Logo &gt;" width="300" class="centered fullwidthmobile" /></p>

<p>In this project spanning from August 2024 to the end of July 2026, we will explore research questions like:</p>

<ol>
<li>To what extent do institutional web archives capture personalized advertisements on the web?</li>
<li>Do scholarly and lay users prefer (re)using archived web pages that include personalized ads, that include a generic comprehensive capture, or that do not include web pages with ads?</li>
<li>How might the strategic use of diverse personas illuminate web content that would otherwise go unarchived?</li>
</ol>

<p>Our continued focus on diversity will further resonate with this project as we seek to preserve more accurate representations of past web pages than what you might have seen as captured from a web archive’s crawler. This project will consist of four phases including tasks for evaluation that will allow us to circle back to prior phases with further informed exploratory processes after the initial investigations. Each of the respective phases will focus on (1) Persona development and data collection, (2) Evaluation of data relative to web archives’ holdings, (3) Evaluation of captures based on users’ expectations, and (4) Results dissemination and technical web archive supplementation.</p>

<figure>
  <img src="https://log.lab.matkelly.com/assets/2024-07-31/location_based_ad_1_during_crawl_time.png" alt="Location-based ad" style="width: 50%; float: left;" />
  <img src="https://log.lab.matkelly.com/assets/2024-07-31/personalized_ad_GPU_AI.png" alt="Personalized Ad" style="width: 50%;" />
  <figcaption>Two examples of personalized ads: one is location-based (a 2024 concert in Virginia for a CNN.com page from 2013) and one is based on browsing history (a current NVIDIA graphics card for a 2014 article).</figcaption>
</figure>

<p><img src="/assets/logos/saa-logo.png" alt="SAA Logo &gt;" width="300" class="mug" />
<img src="/assets/logos/ipres2024-logo.png" alt="iPRES 2024 Logo &gt;" width="300" class="mug" /></p>

<p>Some initial findings from the preliminary planning grant of this project, so far, have been presented at the <a href="https://www2.archivists.org/am2024/research-forum-2024">2024 Society of American Archivists (SAA) Research Forum</a> (<a href="https://matkelly.com/SAA2024">slides</a>) and will be presented at the upcoming <a href="https://ipres2024.pubpub.org/">2024 International Conference on Digital Preservation (iPRES)</a> this September. Further, additional technical details on our initial exploration are being finalized for dissemination as we bridge our IMLS Planning and upcoming Applied Research Grant.
<!--
![Drexel CCI Logo <](/assets/logos/drexel-cci.png){: width="10" .logo .logosmol}
![ODU WS-DL Logo >](/assets/logos/odu-wsdl.png){: width="100" .logo .logosmol}
--></p>

<p>Additional information about the project is <a href="https://www.imls.gov/grants/awarded/lg-256695-ols-24">available directly from IMLS</a> with specific details, use cases, planning, progress, and other details to come as the project progresses. Thank you again, IMLS, for the continued support of the “Web Ads” project by Drexel CCI and ODU WS-DL!</p>

<!--
This project will be coordinated by investigators at Drexel <a href="https://matkelly.com">Mat Kelly</a> (<a href="https://x.com/machawk1">@machawk1</a> and <a href="https://drexel.edu/cci/about/directory/P/Poole-Alex/">Alex H. Poole</a>) and ODU (<a href="https://www.cs.odu.edu/~mln">Michael L. Nelson</a> (<a href="https://x.com/phonedude_mln">@phonedude_mln</a>) and <a href="https://weiglemc.github.io/">Michele C. Weigle</a> (<a href="https://digipres.club/@weiglemc">@weiglemc@digipres.club</a>).-->]]></content><author><name>Mat Kelly</name></author><category term="funding" /><category term="imls" /><category term="webads" /><category term="wsdl" /><category term="michael-nelson" /><category term="michele-weigle" /><category term="alex-poole" /><category term="personalization" /><category term="saa" /><category term="ipres" /><summary type="html"><![CDATA[Our IMLS NLG-L grant on assessing and improving web archives' holdings of online advertisements has been selected for funding!]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://log.lab.matkelly.com/assets/2024-07-31/twitter-card.png?202407311018" /><media:content medium="image" url="https://log.lab.matkelly.com/assets/2024-07-31/twitter-card.png?202407311018" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">CCI DSA Research Showcase 2024</title><link href="https://log.lab.matkelly.com/cci-dsa-showcase-2024" rel="alternate" type="text/html" title="CCI DSA Research Showcase 2024" /><published>2024-05-29T00:00:00+00:00</published><updated>2024-05-29T00:00:00+00:00</updated><id>https://log.lab.matkelly.com/cci-dsa-showcase-2024</id><content type="html" xml:base="https://log.lab.matkelly.com/cci-dsa-showcase-2024"><![CDATA[<blockquote>
"Never miss an opportunity to present your research."
</blockquote>

<p>I was advised of this (by <a href="https://x.com/phonedude_mln">@phonedude_mln</a>) while a PhD student. On May 28, 2024, the <a href="https://drexel.edu/cci/">Drexel CCI</a> <a href="https://drexel.edu/cci/current-students/doctoral-students/cci-doctoral-student-association/">Doctoral Student Association (DSA)</a> held the First Annual Research Showcase, which invited students, faculty, and others in the college to present and witness ongoing research by any students or faculty that opted to present.</p>

<p>I opted to present.</p>

<p>I have reiterated the advice that started this post to my PhD student, <a href="https://choihywook.github.io/">Hyung Wook Choi</a>, who also opted to present.</p>

<p>This post is a summary of this brief event.</p>

<h1 id="the-event">The Event</h1>

<p><img src="/assets/2024-05-29/dsa-logo.png" alt="DSA logo &gt;" width="300" class="centered" /></p>

<p>The event commenced at 4 pm on the 10th floor of Drexel CCI in Philadelphia. The event was composed of about a dozen student posters and a series of 8 5-minute lightning talks at 5 pm following the poster session.</p>

<h2 id="poster-session">Poster Session</h2>

<p><img src="/assets/2024-05-29/2024-05-29-food.jpg" alt="Refreshments at the DSA Research Showcase &gt;" width="200" /></p>

<p>Attendees were able to mingle and discuss students’ research presented on posters during the first hour, while enjoying refreshments. I visited several posters and spoke with the students to understand their research explorations.</p>

<p><span class="person"><a href="https://www.laylab.me/">Layla Bouzoubaa</a></span> (<a href="https://x.com/Bouzoulay">@Bouzoulay</a>) presented, “Stigma: Classified Contextualized Encounters”, a work-in-progress project that looks into the notion of stigma concerning drug use on online forums like various subreddits.</p>

<!--Kshitij Kayastha-->

<!--Manil Shrestha-->

<p><span class="person"><a href="https://www.linkedin.com/in/steve-earth-70988683/">Steve Earth</a></span> provided a poster correlating the ability of students to write proofs with prior experience in math and programming courses.</p>

<p><span class="person"><a href="https://smpascua.com/">Sonia Pascua</a></span> (<a href="https://x.com/sony_d_gr8t">@sony_d_gr8t</a>) showed her work that explores <a href="https://lincs.cci.drexel.edu/weimao/">Dr. Weimao Ke</a>’s (<a href="https://x.com/keweimao">@keweimao</a>) DLITE formulation model for information representation and knowledge organization.</p>

<p><span class="person"><a href="https://www.katiezellner.com/">Katie Zellner</a></span> (<a href="https://x.com/katzellnerd">@katzellnerd</a>) presented her work with <a href="https://cci.drexel.edu/faculty/asarcevic/">Dr. Aleksandra Sarcevic</a> on the effectiveness of physical user interfaces in simulated medical settings.</p>

<p><img src="/assets/2024-05-29/2024-05-29-aria.jpg" alt="Aria presenting his poster&gt;" width="200" class="fullwidthmobile" /></p>

<p><span class="person">Aria Pessianzadeh</span> (<a href="https://x.com/apessianIU">@apessianIU</a>) presented his work investigating a more refined stance model for controversial topics like affirmative action as reflected in Reddit comments. He anticipated his model to be generally applicable to other controversial topics discussed in online discourse.</p>

<p><img src="/assets/2024-05-29/2024-05-29-lu.jpg" alt="Lu presenting her poster&gt;" width="200" class="fullwidthmobile" /></p>

<p><span class="person">Lu Wang</span>’s poster described her research in detecting biases in large language models (LLMs), how to measure them, and how to mitigate them.</p>

<p><span class="person"><a href="https://halflingwizard.github.io/">Matt Namvarpour</a></span> showed his work focusing on the roles that technology companies play in children’s online safety.</p>

<!--David Breen-->
<p><br style="clear: both;" />
<img src="/assets/2024-05-29/2024-05-29-abass.jpg" alt="Abass presenting his poster &gt;" width="200" class="fullwidthmobile" />
<span class="person">Abass Ahmed</span> (<a href="https://x.com/unbent_">@unbent_</a>) presented his poster exploring the current state of immigrant support in the Shenandoah Valley region of Virginia.</p>

<p><br /></p>

<h2 id="lightning-talks">Lightning Talks</h2>

<p>Following the poster session in the first hour of the event were a series of 5-minute lightning talks.</p>

<p><img src="/assets/2024-05-29/2024-05-29-steve.jpg" alt="Steve &gt;" width="250" class="fullwidthmobile" /></p>

<p><span class="person">Steve Earth</span> (who also presented a poster) further elaborated on his exploration of skill transfer between programming education and proofwriting proficiency.</p>

<p><img src="/assets/2024-05-29/2024-05-29-wook.jpg" alt="Wook &gt;" width="250" class="fullwidthmobile" /></p>

<p>My PhD student, <span class="person"><a href="https://choihywook.github.io/about/">Hyung Wook Choi</a></span>, described her research exploring the semantic evolution of terms between domains and prior approaches that have attempted to solve similar problems.</p>

<p><img src="/assets/2024-05-29/2024-05-29-ed.jpg" alt="Ed &gt;" width="250" class="fullwidthmobile" /></p>

<p><span class="person"><a href="https://edwardkim.net/">Edward Kim</a></span> (<a href="https://x.com/edk208">@edk208</a>) described his work on adding a “prefrontal cortex” to mitigate harms, essentially (sic) lobotomizing them.</p>

<p><img src="/assets/2024-05-29/2024-05-29-shadi.jpg" alt="Shadi &gt;" width="250" class="fullwidthmobile" /></p>

<p><span class="person"><a href="https://www.shadirezapour.com/">Shadi Rezapour</a></span> (<a href="https://x.com/shadi_rezapour">@shadi_rezapour</a>) talked about her work on integrating methods of exploring online interaction to enhance socially aware models.</p>

<p><img src="/assets/2024-05-29/2024-05-29-mat.jpg" alt="Shadi &gt;" width="250" class="fullwidthmobile" /></p>

<p>I (<span class="person">Mat Kelly</span>, <a href="https://x.com/machawk1">@machawk1</a>) presented a summary of my research on complex information retrieval tasks for web archives in dimensions beyond time (<a href="/assets/2024-05-29/slides.pdf#toolbar=0&amp;navpanes=0&amp;scrollbar=0">slides available below</a>).</p>

<p><br style="clear: both;" /></p>
<figure style="margin: auto; text-align: center; display: flex; justify-content: center;" class="hoverfig">
<a href="https://matkelly.com/dsa2024" target="_blank" style="display: block; width: 75%;"><img src="/assets/2024-05-29/first-slide.png" class="fullwidth fullwidthmobile" style="margin: auto; display: block;" /><figcaption style="margin-top: 0;">Slides for Mat Kelly's presentation,<br />"To Request Is Human, To Retrieve Divine"</figcaption></a>

</figure>

<!--<iframe src="/assets/2024-05-29/slides.pdf#toolbar=0&navpanes=0&scrollbar=0" style="width: 100%; border: 0; height: 500px;" />-->

<p><img src="/assets/2024-05-29/2024-05-29-afsaneh.jpg" alt="Shadi &gt;" width="250" class="fullwidthmobile" /></p>

<p><span class="person"><a href="https://www.afsanehrazi.com/">Afsaneh Razi</a></span> (<a href="https://x.com/Afsaneh_Razi">@Afsaneh_Razi</a>) talked about her recently presented papers at <a href="https://chi2024.acm.org/">CHI’24</a> discussing how teens perceive personalized content online and whether they preferred the advice of trained counselors for dire situations over the advice of AI-generated responses.</p>

<p><img src="/assets/2024-05-29/2024-05-29-sonia.jpg" alt="Sonia Pascua &gt;" width="250" class="fullwidthmobile" /></p>

<p><span class="person">Sonia Pascua</span> (who also had a poster) went into further detail about her dissertation research on the DLITE methods for information retrieval, particularly on how it compared to other metrics.</p>

<h1 id="closing-and-acknowledgement">Closing and Acknowledgement</h1>

<p>After Sonia’s presentation and a slight pause to socialize, awards were presented to the three best posters, as evaluated by a panel of three judges.</p>

<p><img src="/assets/2024-05-29/2024-05-29-awards.jpg" alt="Awards&gt;" class="fullwidth fullwidthmobile" /></p>

<p><br style="clear: both;" /></p>

<p><img src="/assets/2024-05-29/2024-05-29-layla.jpg" alt="Steve &gt;" width="250" class="fullwidthmobile" /></p>

<p>Overall, the First Annual Research Showcase put on by Drexel CCI’s Doctoral Student Association was a great chance to hear about students’ and faculty’s ongoing research as well as to get together in an informal setting to socialize with our CCI peers. I look forward to the DSA putting on this event again in 2025. A special thanks goes to Layla Bouzoubaa for organizing the event.</p>

<hr />

<p>EDIT: Also, a special thanks to <span class="person">John Kunze</span> (<a href="https://x.com/jakkbl">@jakkbl</a>) for reporting some typos in this post after publication. They have since been corrected.</p>]]></content><author><name>Mat Kelly</name></author><category term="research" /><category term="hyung-wook-choi" /><category term="drexel" /><category term="cci" /><category term="phd-dsa" /><category term="michael-nelson" /><category term="2024" /><category term="layla-bouzoubaa" /><category term="steve-earth" /><category term="sonia-pascua" /><category term="katie-zellner" /><category term="aria-pessianzadeh" /><category term="lu-wang" /><category term="abass-ahmed" /><category term="edward-kim" /><category term="shadi-rezapour" /><category term="afsaneh-razi" /><category term="weimao-ke" /><category term="aleksandra-sarcevic" /><summary type="html"><![CDATA[PhD student and research faculty were offered an opportunity to present their research at the CCI First Annual Research Showcase]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://log.lab.matkelly.com/assets/2024-05-29/twitter-card.jpg?202405291713" /><media:content medium="image" url="https://log.lab.matkelly.com/assets/2024-05-29/twitter-card.jpg?202405291713" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">Reflections on the IIPC General Assembly and Web Archiving Conference (WAC) 2024</title><link href="https://log.lab.matkelly.com/iipc-wac-2024" rel="alternate" type="text/html" title="Reflections on the IIPC General Assembly and Web Archiving Conference (WAC) 2024" /><published>2024-05-17T00:00:00+00:00</published><updated>2024-05-17T00:00:00+00:00</updated><id>https://log.lab.matkelly.com/iipc-wac-2024</id><content type="html" xml:base="https://log.lab.matkelly.com/iipc-wac-2024"><![CDATA[<p>The <a href="https://netpreserve.org/ga2024/">IIPC General Assembly and Web Archiving Conference 2024</a>, held at the <a href="https://www.bnf.fr/en/francois-mitterrand">François-Mitterrand site</a> of the <a href="https://www.bnf.fr/en">Bibliothèque nationale de France (BNF)</a>, is an annual gathering that brings together web archiving professionals, scholars, and enthusiasts from around the globe, fostering an environment of collaboration and innovation. 
<!-- endexcerpt --></p>

<figure style="float: right; width: 300px;">
<img src="/assets/2024-05-17/walkway.jpg" class="logo" />
<figcaption>The walkway to the entrance of the Fran&ccedil;ois-Mitterrand site.</figcaption>
</figure>

<h1 id="day-1-reserved-for-iipc-members">Day 1: Reserved for IIPC Members</h1>

<p>The conference began on April 24th with the General Assembly exclusively for IIPC members. The morning sessions included opening remarks and a chair address in the Grand Auditorium, followed by a detailed presentation of the IIPC Strategic Plan 2026-2031. This session was intended to outline the future directions and priorities for the consortium. However, as my institution is not a member of IIPC, I did not attend the day’s activities.</p>

<h1 id="day-2-pre-conference-workshops-and-networking">Day 2: Pre-Conference Workshops and Networking</h1>

<p><img src="/assets/2024-05-17/sawood-alam.jpeg" alt="Sawood Alam &gt;" class="mug" /></p>

<p><img src="/assets/2024-05-17/mark-phillips.jpeg" alt="Mark Phillips &gt;" class="mug" /></p>

<p>I joined the conference on April 25th for the pre-conference workshops. One of the workshops I attended was “Leveraging Parquet Files for Efficient Web Archive Collection Analytics,” led by Sawood Alam (<a href="https://x.com/ibnesayeed">@ibnesayeed</a>) and Mark Phillips (<a href="https://x.com/vphill">@vphill</a>). This session introduced innovative methods for handling large datasets efficiently, providing practical skills and insights that I found valuable.</p>

<p>The official start of the Web Archiving Conference followed the workshops. The keynote panel on Skyblog, the French pioneer of digital social networks, provided a historical perspective on the evolution of digital social networks.</p>

<p><img src="/assets/2024-05-17/unique-content.jpg" alt="Unique Content &gt;" class="fullwidth" /></p>

<p><br style="clear: both;" /></p>

<p><img src="/assets/2024-05-17/meghan-lyon.jpeg" alt="Meghan Lyon &gt;" class="mug" /></p>

<p>One of the highlights of the conference for me was Session 2, “Unique Content,” where I had the honor of presenting our paper titled “Saving Ads: Assessing and Improving Web Archives’ Holdings of Online Advertisements.” This session, chaired by Meghan Lyon (<a href="https://x.com/aquatic_archive">@aquatic_archive</a>) from the Library of Congress, featured diverse topics, from exploring thematic collections of street art to preserving digital artworks. Our presentation sparked engaging discussions on the challenges and strategies in archiving online advertisements, emphasizing the importance of such efforts in understanding digital culture and consumer behavior.</p>

<figure style="float: right;">
<img src="/assets/2024-05-17/hallway.jpg" class="fullwidth" />
<figcaption>The imposing hallway leading to the Petit Auditorium where I gave my presentation. Photo by Anders Klindt Myrvoll.</figcaption>
</figure>

<p>The beginning of the Unique Content session during which I gave my talk. 
<!--<br style="clear: both;">--></p>
<iframe src="/assets/2024-05-17/slides_iipcwac2024.pdf#view=fit" class="slides fullwidth" style="width: 100%; border: 0; height: 500px;"></iframe>

<h2 id="continued-learning-and-collaboration">Continued Learning and Collaboration</h2>

<p><img src="/assets/2024-05-17/ldko.jpeg" alt="Lauren Ko &gt;" class="mug" /></p>

<p>The afternoon sessions continued with Session 4, “Delivery &amp; Access,” chaired by Lauren Ko, and examined the challenges and solutions in ensuring access to archived web content. Workshops on browser-based crawling and quality assurance provided hands-on experiences and practical knowledge.</p>

<p>The day ended with a series of lightning talks that showcased innovative approaches and emerging technologies in web archiving. From generative AI to deduplication challenges, these talks highlighted the field’s dynamic and rapidly evolving nature.</p>

<h1 id="day-3-closing-with-vision">Day 3: Closing with Vision</h1>

<p>The final day, April 26th, continued with insightful panels and sessions. The panel on “Archiving Social Media in an Age of APIcalypse” was particularly relevant, addressing the current challenges posed by changes in social media APIs.</p>

<p>The closing keynote by Benoît Sagot (<a href="https://x.com/bensagot">@bensagot</a>) was a fitting end to the conference, providing a forward-looking perspective on the future of web archiving. The closing remarks encapsulated the essence of the conference—an event marked by learning, sharing, and envisioning the future of digital preservation.</p>

<h2 id="exploring-the-bnf-françois-mitterrand-and-richelieu-sites">Exploring the BNF: François-Mitterrand and Richelieu Sites</h2>

<p>The conference was hosted at the François-Mitterrand site of the BNF, a modern architectural marvel that stands as a testament to contemporary design and functionality. The BNF, as the national library of France, holds a pivotal role in preserving the nation’s literary and cultural heritage. Web archiving is an important part of its mandate, as is ensuring that digital content is captured and preserved for future generations.</p>

<p>An additional highlight was the opportunity to visit the historic Richelieu site, the original location of the BNF. Established in the 18th century, the Richelieu site has a rich history as the repository of France’s literary and cultural treasures. Its majestic reading rooms and extensive collections provide a glimpse into the past while continuing to serve researchers and scholars.</p>

<h1 id="reflections">Reflections</h1>

<p>Attending and presenting at the IIPC General Assembly and Web Archiving Conference 2024 was a remarkable experience. It was an opportunity to share our work, gain new insights, and forge connections with fellow professionals dedicated to preserving the digital heritage. The diverse sessions and workshops underscored the many faceted nature of web archiving and the importance of collaboration and continuous learning in this field.</p>

<h1 id="acknowledgement">Acknowledgement</h1>

<p>We would like to thank both the organizers of IIPC WAC for inviting us to attend the event and the Institute of Museum and Library Services (via <a href="https://www.imls.gov/grants/awarded/lg-252362-ols-22">grant #LG-252362-OLS-22</a>) for providing support for our travel.</p>]]></content><author><name>Christopher Rauch</name></author><category term="iipc" /><category term="wac" /><category term="iipc-wac" /><category term="christopher-rauch" /><category term="trip-report" /><category term="paris" /><category term="france" /><category term="sawood-alam" /><category term="mark-phillips" /><category term="lauren-ko" /><category term="meghan-lyon" /><category term="bnf" /><category term="imls" /><category term="web-ads" /><summary type="html"><![CDATA[Trip Report for the IIPC Web Archiving Conference (WAC) 2024 in Paris, France]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://log.lab.matkelly.com/assets/2024-05-17/twitter-card.jpg?202405171211" /><media:content medium="image" url="https://log.lab.matkelly.com/assets/2024-05-17/twitter-card.jpg?202405171211" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">MatLab 2023 Year-in-Review</title><link href="https://log.lab.matkelly.com/year-in-review-2023" rel="alternate" type="text/html" title="MatLab 2023 Year-in-Review" /><published>2024-01-02T00:00:00+00:00</published><updated>2024-01-02T00:00:00+00:00</updated><id>https://log.lab.matkelly.com/year-in-review-2023</id><content type="html" xml:base="https://log.lab.matkelly.com/year-in-review-2023"><![CDATA[<p>A year among years—this is the report of our research group’s 2023 relating to students, funding, dissemination, teaching, and service.
<!-- endexcerpt -->
<!--
add link to ed's course
TOADD: submitted, pending, failed proposals
TODO: add mention of YAMZ
TODO: spell check this document
TODO: add link to Edsu's course
-->For our research group, 2023 was eventful for both fortunate and unfortunate reasons. It is useful to review all that was encapsulated within the past twelve months for our perspective as well as to serve as a record of how we might improve in the future. This is the second in a series of yearly reviews (the first being the <a href="year-in-review-2022">review of 2022</a>).</p>

<p>In this summary of 2023, I cover aspects relating to <a href="#students">students</a>, <a href="#funding">funding</a>, <a href="#dissemination">dissemination</a>, <a href="#teaching">teaching</a>, <a href="#service">service</a>, <a href="#upcoming">upcoming endeavors</a>, and <a href="#personal">a personal aside</a>.</p>

<p>Before starting with our own, I want to first acknowledge others’ similar reports. This list will be amended inline as I become aware of them.</p>

<ul>
  <li>Michael Ekstrand (new Drexel CCI faculty) <a href="https://md.ekstrandom.net/blog/2023/12/2023">2023 in Review</a></li>
  <li>ODU WS-DL (PhD research group) <a href="https://ws-dl.blogspot.com/2024/04/2024-04-15-review-of-ws-dls-2023.html">Review of WS-DL’s 2023</a></li>
</ul>

<h1 id="students">Students</h1>

<p><img src="/assets/2022-12-30/christopher-rauch.jpeg" alt="Christopher Rauch &gt;" class="mug" /></p>

<p>Our research group is still composed of my two students, Christopher (Chris) Rauch (<a href="https://x.com/chris_2187">@chris_2187</a>) and <a href="https://choihywook.github.io/">Hyung Wook (Wook) Choi</a>. Chris is now researching part-time with <a href="https://www.cs.drexel.edu/~rw37/">Dr. Rosina Weber</a> (<a href="https://x.com/rosinaweber">@rosinaweber</a>) studying AI Ethics. We are hoping to move him to his Ph.D. candidacy proposal in the first half of 2024. He continues to lend his support and expertise to our IMLS-Web Ads grant (see <a href="#webads">below</a>).</p>

<p><br style="clear: both;" /></p>

<p><img src="/assets/2022-12-30/hyung-wook-choi.jpeg" alt="Hyung Wook Choi &gt;" class="mug" /></p>

<p>Wook is now in the second year of her Ph.D. as she continues to explore exploring semantic disambiguation across domains. In October 2023, she had a poster at the 17th International Conference on Metadata and Semantics Research (<a href="http://www.mtsr-conf.org/home">MTSR 2023</a>). More details on that are in the <a href="#dissemination">dissemination section</a> below.</p>

<h2 id="beyond-advising">Beyond Advising</h2>

<p>In support of students beyond those I advise, I was pleased to be involved on the committees and associated Ph.D. dissertation defenses of the now-Drs. Sam Grabus and Hanieh Razzaghi. Sam’s defense was titled, “Historical Subject Representation: An Analysis of Historical Vocabularies for Temporally-Aligned and Contextual Access Points” and Hanieh’s “Semantic Data Quality Assessment: An Investigation of Fitness for Use in Large Clinical Datasets”. Congratulations to them both.</p>

<p>Additionally, I was pleased to be involved in the Ph.D. candidacy review of Drexel CCI Ph.D. student Deanna Zarrillo (<a href="https://x.com/zarrillogical">@zarrillogical</a>), who was evaluated on the topics of computational social science, data policy &amp; ethics, and Science of Science.</p>

<h1 id="funding">Funding</h1>

<p><img src="/assets/2022-08-17-imls.jpeg" alt="IMLS Logo &gt;" class="logo" /></p>

<p><img src="/assets/2024-01-02/alex-poole.jpg" alt="Alex Poole, Drexel CCI collaborator &gt;" class="mug hideonmobile" /></p>

<p>In 2023, we continued our investigation of the <a href="https://www.imls.gov/">IMLS</a>-funded project titled “<a href="https://www.imls.gov/grants/awarded/lg-252362-ols-22">Saving Ads: Assessing and Improving Web Archives’ Holdings of Online Advertisements</a>”. In collaboration with Drs. <a href="https://weiglemc.github.io/">Michele C. Weigle</a> (<a href="https://x.com/weiglemc/">@weiglemc</a>) and <a href="https://www.cs.odu.edu/~mln/">Michael L. Nelson</a> (<a href="https://x.com/phonedude_mln">@phonedude_mln</a>) at <a href="https://www.odu.edu/">Old Dominion University</a>’s <a href="https://oduwsdl.github.io/">WS-DL Research Group</a>, Dr. Alex H. Poole from <a href="https://drexel.edu/cci/">Drexel CCI</a>, and student support from Chris and Ph.D. student <a href="https://sites.google.com/view/treid803/home">Travis Reid</a> (<a href="https://x.com/TReid803">@TReid803</a>), we were able to explore some of the nuances of the past preservation and contemporary state of archiving advertisements on the web. A report of the first year of our findings as well as a pending follow-up investigation is described in the <a href="#upcoming">Upcoming section</a> below.</p>

<p><img src="/assets/2024-01-02/nsf.png" alt="NSF Logo &gt;" class="logo" /></p>

<p>Beyond the IMLS funding, I have been grateful to be involved in the NSF Science of Science: Discovery, Communication, and Impact (SoS:DCI)-supported project, “Examining the effects of academic mobility on individual professors’ research activity and institutional human capital at HBCUs” with members from the Drexel, UT-Knoxville, UW-Madison, and Howard University research communities. In 2023, this project wrapped up the data collection phase has started to analyze the trends of academic migration with respect to HBCUs. We had two such presentations from this project this year (see <a href="#dissemination">below</a>).</p>

<p><img src="/assets/2022-12-30/leading.png" alt="LEADING Logo &gt;" class="logo" /></p>

<p>Lastly, we wrapped up the <a href="https://mrc.cci.drexel.edu/leading/fellows/">final cohort</a> of the IMLS-funded <a href="https://mrc.cci.drexel.edu/leading/">LIS Education And Data Science Integrated Network Group (LEADING)</a> project in December 2023. An overview of “Next Steps…” for the project was presented at the <a href="https://www.cni.org/events/membership-meetings/past-meetings/fall-2023">Coalition for Networked Information (CNI) Fall 2023 Membership Meeting</a>.</p>

<h1 id="dissemination">Dissemination</h1>

<p>While our focus this year was on achieving sustainability of student support through grant writing, we continued to have a productive year for publications and research dissemination.</p>

<ul>
  <li>An article detailing ODU MS student John Berlin’s thesis was published and released in the <a href="https://dl.acm.org/doi/10.1145/3589206">TWEB Journal</a>.</li>
  <li>I was involved in a journal article with Dr. Jane Greenberg (<a href="https://x.com/all_metadata">@all_metadata</a>)) in the Data Intelligence journal titled, <a href="https://direct.mit.edu/dint/article/5/1/242/115158/Building-Community-Consensus-for-Scientific">Building Community Consensus for Scientific Metadata with YAMZ</a>.</li>
  <li>Wook and I had a poster at MTSR 2023 in Milan, Italy titled, “<a href="https://arxiv.org/pdf/2310.12369.pdf">On Identifying Points of Semantic Shift Across Domain</a>” (linked preprint)</li>
  <li><a href="https://www.cs.drexel.edu/~ey86/">Erjia Yan</a> (<a href="https://x.com/erjiayan">@erjiayan</a>), Deanna Zarrillo, me, and others on the NSF-Funded project had a publication at <a href="https://cns-iu.github.io/workshops/2023-07-02_issi/">the 20th International Conference of the International Society for Scientometrics and Informetrics (ISSI)</a> titled, “<a href="https://matkelly.com/papers/2023_issi_yan.pdf">Examining the academic mobility at Historically Black Colleges and Universities in the U.S.</a>”.</li>
  <li>I also presented some preliminary findings from this project at the <a href="https://netpreserve.org/ga2023/">International Internet Preservation Consortium (IIPC) Web Archiving Conference (WAC) 2023</a> titled, “<a href="https://matkelly.com/presentations/2023_iipcwac_braindrain.pdf">Using Web Archives to Model Academic Migration and Identify Brain Drain</a>” (slides linked).</li>
  <li>Lastly, I lead an effort to hold the <a href="https://ieeexplore.ieee.org/document/10266131">Web Archiving and Digital Libraries (WADL) Workshop at JCDL 2023</a> this year with the help of <a href="https://reyesayala.github.io/">Brenda Reyes Ayala</a> (<a href="https://x.com/CamtheWicked/">@CamtheWicked</a>), <a href="https://www.zhiwuxie.com/">Zhiwu Xie</a> (<a href="https://x.com/zxie/">@zxie</a>), <a href="https://fox.cs.vt.edu/">Ed Fox</a> (<a href="https://x.com/edwardafox/">@edwardafox</a>) and all of the attendants in Santa Fe, New Mexico. More information on this event is described in the <a href="#travel">Travel</a> section.</li>
</ul>

<h1 id="teaching">Teaching</h1>

<p>Drexel is on the quarter system, which as tenure-track faculty, I teach three classes per year: one in each of Fall, Winter, and Spring. For this year-in-review, I will focus on the Winter (January-March), Spring (March-June), and Fall (September-December) 2023 quarters.</p>

<p>To start the year, I taught INFO102 - Introduction to Information Systems. Even as one with a computer science background, much of the content was fundamental to information systems, data handling, collaboration, and ethics. Inheriting a course shell on the material proved useful but even for the sizeable undergraduate class, there seemed to be minimal enthusiasm.</p>

<p>Spring quarter had me teaching INFO202 - Data Curation for the third time in my career at Drexel. With some previous evolution of the course by <a href="https://www.cs.drexel.edu/~ey86/">Dr. Erjia Yan</a> loosely based on <a href="https://inkdroid.org/">Ed Summers’</a> Digital Curation course at UMD. The adaptation was going swimmingly, and the students seemed to prefer interactively working with data over ingesting lecture material. At one point during the term, a personal caveat (see <a href="#personal">below</a> if interested) caused a hiccup in the presentation of this course. I was able to return to instruction with one final data-based assignment before wrapping up the school year for an unfortunately eventful Summer. I still owe Ed an email about these adaptations. There was a plan to have a separate blog post on the updates (and perhaps there eventually will be) but life got in the way of this instance.</p>

<p><img src="/assets/2024-01-02/elastic.png" alt="Elasticsearch Logo &gt;" class="logo" /></p>

<p>In the Fall quarter of 2023, I taught INFO624: Information Retrieval Systems to a cross-listed course of 24 students, 12 of whom were in-person at Drexel CCI and 12 online. This was the fourth time I taught the course and incorporated some newer assignment material where students had a chance to become more familiar with <a href="https://www.elastic.co/elasticsearch">Elasticsearch</a> through implementing concepts described in class.</p>

<h1 id="travel">Travel</h1>

<p><img src="/assets/2024-01-02/santafe_cathedral.jpg" alt="Cathedral Basilica of St. Francis of Assisi in Santa Fe, NM &gt;" class="fullwidth" /></p>

<p><img src="/assets/2022-12-30/jcdl-2023.png" alt="JCDL 2023 Logo &gt;" class="logo" /></p>

<p>I had two travel events this year – one to the <a href="https://2023.jcdl.org/">ACM/IEEE Joint Conference on Digital Libraries (JCDL) 2023</a> in Santa Fe, New Mexico (where I was a <a href="https://2023.jcdl.org/organizers/organizing-committee/#:~:text=Alamos%20National%20Laboratory-,Program%20Chairs,-Anat%20Ben%2DDavid">conference program chair</a>) (see <a href="https://ws-dl.blogspot.com/2023/07/2023-07-26-acmieee-joint-conference-on.html">ODU WS-DL’s Trip Report</a>) and one to Birmingham, UK to the <a href="https://uobevents.eventsair.com/cikm2023/">ACM Conference on Information and Knowledge Management (CIKM 2023)</a> where I was the publicity chair (<a href="https://x.com/cikm2023">@cikm2023</a>) for the conference.</p>

<p><br style="clear: both;" /></p>

<p><img src="/assets/2024-01-02/brum_greathall.jpg" alt="Great Hall at University of Birmingham &gt;" class="fullwidth" /></p>

<p><img src="/assets/2024-01-02/cikm.jpeg" alt="CIKM 2023 Logo &gt;" class="logo" /></p>

<p>Both were great opportunities to catch up with former colleagues, learn more about the  workings of an in-person conference, and interface with researchers investigating contemporary topics in their respective fields. Along with being an organizer at JCDL, I was also the lead in the <a href="https://fox.cs.vt.edu/wadl2023.html">WADL Workshop</a> while there in person. We hosted a hybrid audience and presenters with self-admitted success and hope to hold the event again in the future. The fine folks at ODU WS-DL also have provided a report on <a href="https://ws-dl.blogspot.com/2023/08/2023-08-10web-archiving-and-digital.html">their experience of the workshop</a>.</p>

<p><br style="clear: both;" /></p>

<h1 id="service">Service</h1>

<p>For disclosure, I prefer to be transparent about my service involvement in the various communities with which I have been involved over the year. In 2023, I was involved in the conference review process for <a href="https://websci23.webscience.org/">ACM WebSci 2023</a>, <a href="https://ipres2023.us/">iPRES 2023</a>, <a href="https://icadl.net/icadl2023/">ICADL 2023</a>, and <a href="https://www.ischools.org/iconference">iConference 2024</a>. I was also a reviewer for <a href="https://www.tandfonline.com/journals/rint20">Internet Histories</a> and the <a href="https://www.sciencedirect.com/journal/softwarex">SoftwareX</a> journals. My involvement with the <a href="https://link.springer.com/journal/799">International Journal on Digital Libraries (IJDL)</a> continues as <a href="https://link.springer.com/journal/799/editors">a managing editor</a>. Additionally, in 2023, I was also a funding reviewer for the <a href="https://www.nwo.nl/en">Dutch Research Council</a>.</p>

<p>For internal service to the Information Science department at Drexel CCI, I continue to assist with the MSIS Curriculum Committee, of which the course (INFO655) I am <a href="#teaching">teaching</a> this winter (starting January 2024) is integrated.</p>

<h1 id="upcoming">Upcoming</h1>

<p><img src="/assets/2024-01-02/iipcwac-2024.png" alt="IIPC WAC Logo &gt;" class="logo" /></p>

<p>In June 2024, we will be presenting our conference submission, “Saving Ads: Assessing and Improving Web Archives’ Holdings of Online Advertisements” at the IIPC Web Archiving Conference in Paris, France.</p>

<p><img src="/assets/2024-01-02/ijdl.png" alt="IJDL Logo &gt;" class="logo" /></p>

<p>An invited, extended version of my <a href="https://icadl.net/icadl2022/">ICADL 2022</a> paper titled, “<a href="https://doi.org/10.1007/978-3-031-21756-2_25">Aggregator Reuse and Extension for Richer Web Archive Interaction</a>” has been accepted, revised, and will appear in a volume of the <a href="https://link.springer.com/journal/799">International Journal on Digital Libraries (IJDL)</a> with the title, “Exploiting the Untapped Functional Potential of Memento Aggregators Beyond Aggregation”.</p>

<p><img src="/assets/2024-01-02/react.png" alt="React &gt;" class="logo" /></p>

<p>In 2024, I will be teaching an overhauled version of INFO655: Intro to Web Programming that will incorporate the newer paradigms involved in programming web applications using <a href="https://react.dev/">React</a>.</p>

<p>Lastly, regarding teaching, I am slated to again teach INFO202: Data Curation. I am hoping to complete the first instance of this course without any caveat (see <a href="#personal">this year</a>’s below).</p>

<h1 id="personal">Personal</h1>

<p>Ebbs and flows this year. Some personal matters caused personal and professional hiccups this year. I am detailing them here for my record and closure for anyone who clicked the mention above.</p>

<p><img src="/assets/2024-01-02/benjamin.jpg" alt="Benjamin &gt;" class="logo" /></p>

<p>While halfway through the term of teaching INFO202, on May 14, 2023, we took our 1-year-old Benjamin (introduced in the <a href="https://log.lab.matkelly.com/year-in-review-2022#:~:text=in%20future%20iterations.-,In%20Spring%202022,-%2C%20I%20was%20on">Teaching Section of the 2023 Year-in-Review</a> to the emergency room where he was diagnosed with Type 1 diabetes. Recently graduate Ph.D. student, now Dr. Deborah Garwood, was able to substitute in and teach multiple lectures of the course in its original form. For that I am grateful.</p>

<p>Nearly immediately after returning from JCDL 2023 in Santa Fe in July, I was hospitalized with severe chest pain that I thought surely was my end and part of the motivation for publishing a report like this, even if the mention was a subtext. It was chalked up to coronary vasospasm, of which I have no history. I am still uncertain if it has any relation to the travel.</p>

<p><img src="/assets/2024-01-02/yuri.jpg" alt="Yuri, Space Dog &gt;" class="logo" /></p>

<p>As if July was not already punishing enough, on July 8th our finest canine <a href="https://matkelly.com/spaceDog">Yuri, Space Dog</a>, died after a long bout with a progressing cancer. He gave us over ten of his years was a good boy.</p>

<p><img src="/assets/2024-01-02/grandma.jpg" alt="Dr. Sean Kelly (brother), Eileen Kelly (grandma), Dr. Mat Kelly (me) &gt;" class="logo" /></p>

<p>And lastly, as the year was coming to a close, my last remaining grandparent, Eileen Kelly of Pine Bush, New York passed away on December 18, 2023. This is where I put this post to a close.</p>

<p><br clear="both" /></p>

<h1 id="in-summary">In Summary</h1>

<ul>
  <li>Two journal articles published</li>
  <li>Two conference presentations given</li>
  <li>Three courses taught</li>
  <li>Two grants progressed</li>
  <li>One workshop organized and run</li>
  <li>Two conferences attended in person</li>
  <li>One-and-one-half students advised</li>
  <li>One heart attack</li>
</ul>

<h1 id="acknowledgments">Acknowledgments</h1>

<p>I want to thank the universe for not putting an end to me in July and allowing me time to put this post together. I am hoping for this yearly to be more eventful in a positive light and to produce a more optimistic year-in-review post the end of 2024.</p>

<p>The photos in this post are original captures and should be considered public domain. For completeness, they are from Forksville, Pennsylvania in August 2023 (post header); the Cathedral of St. Francis of Assisi in Santa Fe, New Mexico (JCDL 2023) in June 2023 ; and The Great Hall at University of Birmingham in Birmingham, UK (CIKM 2023) from October 2023. Please feel free to email me, DM me, or <a href="https://github.com/machawk1/log">submit a PR</a> if you notice any errors or omissions. I will acknowledge them here.</p>]]></content><author><name>Mat Kelly</name></author><category term="year-in-review" /><category term="hyung-wook-choi" /><category term="christopher-rauch" /><category term="erjia-yan" /><category term="deanna-zarrillo" /><category term="imls" /><category term="nsf" /><category term="sam-grabus" /><category term="hanieh-razzaghi" /><category term="odu" /><category term="jcdl" /><category term="wadl" /><category term="iipc-wac" /><category term="cikm" /><category term="tweb" /><category term="info624" /><category term="info102" /><category term="info202" /><summary type="html"><![CDATA[A year among years, this is the report of our research group's 2023 relating to students, funding, dissemination, teaching, and service.]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://log.lab.matkelly.com/assets/2024-01-02/twitter-card2.jpg" /><media:content medium="image" url="https://log.lab.matkelly.com/assets/2024-01-02/twitter-card2.jpg" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">MatLab 2022 Year-in-Review</title><link href="https://log.lab.matkelly.com/year-in-review-2022" rel="alternate" type="text/html" title="MatLab 2022 Year-in-Review" /><published>2022-12-30T00:00:00+00:00</published><updated>2022-12-30T00:00:00+00:00</updated><id>https://log.lab.matkelly.com/year-in-review-2022</id><content type="html" xml:base="https://log.lab.matkelly.com/year-in-review-2022"><![CDATA[<p>It was eventful year! In this post we detail news relating to students, funding, dissemination, and courses that occurred in 2022.
<!-- endexcerpt -->
It has taken three years of my association with <a href="https://drexel.edu/">Drexel</a> but in 2022 I am happy to finally provide a report of the status of our efforts over the last year.</p>

<h1 id="students">Students</h1>

<p><img src="/assets/2022-12-30/christopher-rauch.jpeg" alt="Christopher Rauch &gt;" class="mug" /></p>

<p>In 2022, PhD Student Christopher Rauch (<a href="https://twitter.com/chris_2187">@chris_2187</a>) is continuing to explore his research topic and will be the primary Drexel student on our <a href="#funding">recently awarded IMLS grant</a>.</p>

<p><br style="clear: both;" /></p>

<p><img src="/assets/2022-12-30/hyung-wook-choi.jpeg" alt="Hyung Wook Choi &gt;" class="mug" /></p>

<p>A new PhD student, <a href="https://choihywook.github.io/">Hyung Wook Choi</a>, has joined Chris and I to pursue her PhD, initially exploring semantic disambiguation across domains. Upon starting her trek at Drexel, she was also awarded the <a href="https://drexel.edu/graduatecollege/research-funding/fellowships/graduatecollege-fellowships/2022-fellows/">Graduate College Doctoral Gold Fellowship</a>.</p>

<h1 id="funding">Funding</h1>

<p><img src="/assets/2022-12-30/justin-f-brunelle.jpeg" alt="Justin F. Brunelle &gt;" class="mug" /></p>

<p>After a long period of negotiation, we were ability to solidify a collaboration between Drexel CCI and MITRE Corporation to build upon the “VENOM: Archiving the Dark Web” project. Led by <a href="http://justinfbrunelle.com/">Dr. Justin F. Brunelle</a> on the MITRE end and myself from Drexel, we sought to investigate methods for associating dark web URIs together and be able to provide a persistent identifier to solidify the association.</p>

<p><img src="/assets/2022-08-17-imls.jpeg" alt="IMLS Logo &gt;" class="logo" /></p>

<p>In late July 2022, I was information that our grant proposal to the <a href="https://www.imls.gov/">Institute of Museum and Library Services (IMLS)</a> relating to investigating the past and contemporary usage of ads on the web was funded! This collaboration with my PhD advisors, Drs. <a href="https://weiglemc.github.io/">Michele C. Weigle</a> (<a href="https://twitter.com/weiglemc/">@weiglemc</a>) and <a href="https://www.cs.odu.edu/~mln/">Michael L. Nelson</a> (<a href="https://twitter.com/phonedude_mln">@phonedude_mln</a>) from <a href="https://www.odu.edu/">ODU</a>, as well as my colleague, <a href="https://drexel.edu/cci/about/directory/P/Poole-Alex/">Dr. Alex H. Poole</a> from Drexel CCI will support both Drexel and ODU students until 2024 to execute the grant tasks. More information about the grant can be had on the <a href="https://www.imls.gov/grants/awarded/lg-252362-ols-22">grant summary page</a>.</p>

<h1 id="dissemination">Dissemination</h1>

<p><img src="/assets/2022-12-30/asist.jpeg" alt="ASIST Logo &gt;" class="logo" /></p>

<p>On the dissemination front, Deanna Zarrillo (<a href="https://twitter.com/zarrillogical">@zarrillogical</a>) from CCI’s <a href="https://www.cs.drexel.edu/~ey86/lab.html">Scholarly Communication Lab</a> presented our work on an NSF Science of Science: Discovery, Communication, and Impact grant; lead by CCI’s <a href="https://www.cs.drexel.edu/~ey86/">Dr. Erjia Yan</a> (<a href="https://twitter.com/erjiayan">@erjiayan</a>), at the <a href="https://www.asist.org/am22/">Association for Information Science and Technology (ASIS&amp;T) Annual Meeting 2022</a> in Pittsburgh, PA. The <a href="https://asistdl.onlinelibrary.wiley.com/doi/abs/10.1002/pra2.664">paper</a>, “Collecting Diachronic Affiliation Data for Faculty at HBCUs Using Memento”, described some our efforts at exploring the phenomena of Brain Drain from Historically Black Colleges and Universities (HBCUs) by initially leveraging web archives.</p>

<p><img src="/assets/2022-12-30/isko-2022.jpeg" alt="ISKO 2022 Logo &gt;" class="logo" /></p>

<p>While still completing her MS prior to joining the PhD program, Hyung Wook Choi and I published a poster, “Examining Existing Approaches Toward Semantic Disambiguation” at the International Society for Knowledge Organization (ISKO) Conference 2022 in Aalborg, Denmark in July.</p>

<p><img src="/assets/2022-12-30/jcdl-2022.jpeg" alt="JCDL 2022 Logo &gt;" class="logo" /></p>

<p>In June 2022 at the ACM/IEEE Joint Conference on Digital Libraries 2022, I helped organize the <a href="https://fox.cs.vt.edu/wadl2022.html">Web Archiving and Digital Libraries (WADL) Workshop</a> <a href="https://matkelly.com/papers/2022_wadl.pdf">write-up</a> along with Martin Klein (<a href="https://twitter.com/mart1nkle1n">@mart1nkle1n</a>) from <a href="https://www.lanl.gov/">Los Alamos National Laboratory</a> and both <a href="https://www.zhiwuxie.com/">Zhiwu Xie</a> (<a href="https://twitter.com/zxie">@zxie</a>) and <a href="https://fox.cs.vt.edu/">Edward A. Fox</a> from <a href="https://www.vt.edu/">Virginia Tech</a>. At this workshop, we also presented our progress on the above NSF grant with a presentation titled, “<a href="https://matkelly.com/papers/2022_wadl_hbcus.pdf">First steps in Identifying Academic Migration using Memento and Quasi-Canonicalization</a>”.</p>

<p><img src="/assets/2022-12-30/icadl-2022.png" alt="ICADL 2022 Logo &gt;" class="logo" /></p>

<p>Finally, less than a month ago, I remotely presented my full paper, “<a href="https://matkelly.com/papers/2022_icadl_chaining.pdf">Aggregator Reuse and Extension for Richer Web Archive Interaction</a>” at the <a href="https://icadl.net/icadl2022/">International Conference on Asia-Pacific Digital Libraries (ICADL 2022)</a>, which was held in Hanoi, Vietnam. This is the first I have published after this conference, now having completed the DL conference trifecta of JCDL, TPDL, and ICADL.</p>

<h1 id="teaching">Teaching</h1>

<p>In Winter 2022, I was grateful to be able to teach INFO624 Information Retrieval System for the third time at Drexel. I very much enjoy the course and hope to further incorporate interaction with Elasticsearch in future iterations.</p>

<p><img src="/assets/2022-12-30/benjamin.jpeg" alt="Benjamin Patrick &gt;" class="mug" /></p>

<p>In Spring 2022, I was on research leave after having welcomed my son to the world on March 14, 2022.</p>

<p>Finally, in Fall 2022, I recently finished teaching INFO600 Web Systems &amp; Architecture. This is also my third time teaching this class with each iteration being a different modality permutation: first as face-to-face, second as online due to the pandemic, and thirdly as a cross-listed online/face-to-face course.</p>

<h1 id="upcoming">Upcoming</h1>

<p><img src="/assets/2022-12-30/jcdl-2023.png" alt="JCDL 2023 Logo &gt;" class="logo" /></p>

<p>In 2023, I will be involved as a Program Chair in helping to organize the <a href="https://2023.jcdl.org/">ACM/IEEE Joint Conference on Digital Libraries 2023</a> in Sante Fe, New Mexico along with Dr. Martin Klein of LANL, <a href="https://www.ibi.hu-berlin.de/de/institut/personen/jaeschke">Dr. Robert Jäschke</a> of Humboldt-Universität zu Berlin, and <a href="https://www.anatbendavid.info/">Dr. Anat Ben-David</a> (<a href="https://twitter.com/anatbd">@anatbd</a>) of <a href="https://www.openu.ac.il/en/pages/default.aspx">Open University of Israel</a>. It is sure to be an exciting time come next June and I am looking forward to continually being involved in this conference in a new role.</p>

<p><img src="/assets/2022-12-30/cikm-2023.png" alt="CIKM 2023 Logo &gt;" class="logo" /></p>

<p>Additionally, at the invite of <a href="http://www.frommholz.org/">Dr. Ingo Frommholz</a> (<a href="https://twitter.com/iFromm">@iFromm</a>), I will be serving as the Publicity Chair on the Organizing Committee of the <a href="https://cikm2023.github.io/">ACM International Conference on Information and Knowledge Management (CIKM) 2023</a>. More info to come on that conference as we move into the new year.</p>

<p><img src="/assets/2022-12-30/iipcwac-2023.png" alt="IIPC WAC Logo &gt;" class="logo" /></p>

<p>Lastly, just prior to posting this review in early December I was notified of the acceptance of our proposal, “Using Web Archives to Model Academic Migration and Identify Brain Drain” at the <a href="https://netpreserve.org/ga2022/">International Internet Preservation Consortium (IIPC) Web Archiving Conference (WAC) 2023</a>, to occur in May 2023. More information on this presentation disseminating progress on our <a href="#funding">NSF-funded</a> grant will be available in the future.</p>

<h1 id="ongoing">Ongoing</h1>

<p>I am happy to continue my involvement with a few endeavors.</p>

<p><img src="/assets/2022-12-30/leading.png" alt="LEADING Logo &gt;" class="logo" /></p>

<p>For starters, I am still very much involved with Dr. Jane Greenberg’s IMLS-funded <a href="https://mrc.cci.drexel.edu/leading/">LIS Education And Data Science Integrated Network Group (LEADING)</a> project where, in 2022, we had our second cohort of students.</p>

<p><img src="/assets/2022-12-30/ijdl.png" alt="IJDL Logo &gt;" class="logo" /></p>

<p>I am also grateful to continue to be involved as a Managing Editor for the <a href="https://www.springer.com/journal/799">International Journal on Digital Libraries (IJDL)</a>, where I am focusing on novel approaches toward publicity of the journal for soliciting new submissions. The Managing Editors, the Editorial Board, and a representative from Springer will meet on January 9, 2023 to discuss the state and future efforts for the journal.</p>

<h1 id="in-summary">In Summary</h1>

<ul>
  <li>One new PhD student (Wook)</li>
  <li>One peer-reviewed full paper (ICADL)</li>
  <li>One peer-reviewed short paper (ASIS&amp;T)</li>
  <li>One poster (ISKO)</li>
  <li>One workshop publication (WADL)</li>
  <li>One workshop organized (WADL)</li>
  <li>Two new grant awarded (IMLS NLG-L, MITRE-Venom) with the latter fully executed</li>
  <li>Two courses taught (INFO624, INFO600)</li>
</ul>

<h1 id="acknowledgments">Acknowledgments</h1>

<p>The barriers in publishing a yearly review are rigid and tough to overcome given the many other obligations of academia. For finally getting this post out-the-door I want to thank the motivation of <a href="https://jinahuhyoo.com/">Dr. Jina Huh-Yoo</a>, who has managed to publish <a href="https://jinahuhyoophd.medium.com/2021-year-in-review-health-and-happiness-design-lab-at-drexel-university-33971bb5da14">her year-in-review while at Drexel</a>, and also the <a href="https://ws-dl.cs.odu.edu">ODU WS-DL research group</a>, who continually publishes <a href="https://ws-dl.blogspot.com/search/label/WS-DL%20Review">such posts</a> each January.</p>]]></content><author><name>Mat Kelly</name></author><category term="year-in-review" /><category term="hyung-wook-choi" /><category term="christopher-rauch" /><category term="erjia-yan" /><category term="deanna-zarrillo" /><category term="imls" /><category term="nsf" /><category term="cikm" /><category term="jcdl" /><category term="iipcwac" /><category term="leading" /><category term="mitre" /><summary type="html"><![CDATA[It was eventful year! In this post we detail news relating to students, funding, dissemination, and courses that occurred in 2022.]]></summary><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://log.lab.matkelly.com/assets/2022-12-30/2022-12-30-yir.jpeg" /><media:content medium="image" url="https://log.lab.matkelly.com/assets/2022-12-30/2022-12-30-yir.jpeg" xmlns:media="http://search.yahoo.com/mrss/" /></entry><entry><title type="html">Collaborative Study Highlighting the Importance of Web Ads Funded by IMLS</title><link href="https://log.lab.matkelly.com/imls-webads" rel="alternate" type="text/html" title="Collaborative Study Highlighting the Importance of Web Ads Funded by IMLS" /><published>2022-08-17T00:00:00+00:00</published><updated>2022-08-17T00:00:00+00:00</updated><id>https://log.lab.matkelly.com/imls-webads</id><content type="html" xml:base="https://log.lab.matkelly.com/imls-webads"><![CDATA[<p class="crosspost">This write-up was originally posted to the <a href="https://ws-dl.blogspot.com/2022/08/2022-08-17-collaborative-study.html">ODU WS-DL blog</a>.</p>

<p><img src="/assets/2022-08-17-imls.jpeg" alt="IMLS Logo &gt;" width="350" /></p>

<p>Our IMLS NLG-L grant, “Saving Ads: Assessing and Improving Web Archives’ Holdings of Online Advertisements” has been selected for funding!</p>

<p>We are pleased to announce that a new collaboration between <a href="https://drexel.edu/cci/">Drexel University College of Computing &amp; Informatics</a> and the <a href="https://ws-dl.cs.odu.edu/">ODU Web Science and Digital Libraries (WS-DL) Research Group</a> has been funded by the <a href="https://www.imls.gov/">Institute of Library and Museum and Library Services (IMLS)</a> for the amount of $149,479. The two-year project, “<strong>Saving Ads: Assessing and Improving Web Archives’ Holdings of Online Advertisements</strong>” is led by WS-DL alumnus <a href="https://matkelly.com/">Mat Kelly</a> with WS-DL’s <a href="https://www.cs.odu.edu/~mln/">Michael L. Nelson</a> and <a href="https://www.cs.odu.edu/~mweigle/">Michele C. Weigle</a> and Drexel CCI’s <a href="https://drexel.edu/cci/about/directory/P/Poole-Alex/">Alex Poole</a> as co-investigators.</p>

<p>This work will focus on the preservation of online advertisements in the past and help to inform methods going forward. Online ads have a similar, if not great cultural significance as print advertisements. For example, embedded ads for masks since the beginning of the COVID-19 pandemic in Spring 2020 depict social norms of a time in much of the same way as ads for Camel Cigarettes did in 1946. However, major public web archives are failing to capture many embedded ads in their archived pages.</p>

<figure>
  <img src="https://log.lab.matkelly.com/assets/2022-08-17-ad.jpeg" alt="Contemporary advertisements on the web are indicators of cultural significance much like those from print media of the past." />
  <figcaption>Contemporary advertisements on the web are indicators of cultural significance much like those from print media of the past.</figcaption>
</figure>

<p>Contemporary advertisements on the web are indicators of cultural significance much like those from print media of the past.
We have proposed to study the gap by analyzing the need for and feasibility of archiving advertisements that are embedded in web pages. This will entail an assessment using mixed methods to learn what aspects of online ads future scholars might be interested in studying.</p>

<p>Through this two-year project, we will produce two data sets of online advertisements and their archived contexts to be used for further research. We will also produce a quantitative baseline for which sorts of ads were or were not previously archives and provide a qualitative assessment of the significance of the missing ads. We anticipate this work to be the basis for future larger scale studies to highlight the cultural impact that online advertisements have had in the past and will in the future.</p>

<p>For additional information on this project, please <a href="https://www.imls.gov/grants/awarded/lg-252362-ols-22">see our detailed project narrative</a> made publicly available by IMLS. We are grateful for the support of this project by IMLS and looking forward to widely disseminating the results in the future.</p>]]></content><author><name>Mat Kelly</name></author><category term="funding" /><category term="imls" /><category term="webads" /><category term="wsdl" /><category term="michael-nelson" /><category term="michele-weigle" /><category term="alex-poole" /><summary type="html"><![CDATA[Our IMLS NLG-L grant on assessing and improving web archives' holdings of online advertisements has been selected for funding!]]></summary></entry></feed>