Rogue Scholar Digest March 6, 2024

digest

This is a summary of the Rogue Scholar blog posts published February 28 - March 5, 2024.

Author

Affiliation

Martin Fenner

Front Matter

Published

March 6, 2024

Code

import requests
import locale
import re
from typing import Optional
import datetime
from IPython.display import Markdown

locale.setlocale(locale.LC_ALL, "en_US")
baseUrl = "https://api.rogue-scholar.org/"
published_since = "2024-02-28"
published_until = "2024-03-06"
feature_image = 0
include_fields = "title,authors,published_at,summary,blog_name,blog_slug,doi,url,image"
url = (
    baseUrl
    + f"posts?&published_since={published_since}&published_until={published_until}&language=en&sort=published_at&order=asc&per_page=50&include_fields={include_fields}"
)
response = requests.get(url)
result = response.json()


def get_post(post):
    return post["document"]


def format_post(post):
    doi = post.get("doi", None)
    url = f"[{doi}]({doi})\n<br />" if doi else ""
    title = f"[{post['title']}]({doi})" if doi else f"[{post['title']}]({post['url']})"
    published_at = datetime.datetime.utcfromtimestamp(post["published_at"]).strftime(
        "%B %-d, %Y"
    )
    blog = f"[{post['blog_name']}](https://rogue-scholar.org/blogs/{post['blog_slug']})"
    author = ", ".join([f"{x['name']}" for x in post.get("authors", None) or []])
    summary = post["summary"]
    return f"### {title}\n{url}Published {published_at} in {blog}<br />{author}<br />{summary}\n"


posts = [get_post(x) for i, x in enumerate(result["hits"])]
posts_as_string = "\n\n".join([format_post(x) for x in posts])

def doi_from_url(url: str) -> Optional[str]:
    """Return a DOI from a URL"""
    match = re.search(
        r"\A(?:(http|https)://(dx\.)?(doi\.org|handle\.stage\.datacite\.org|handle\.test\.datacite\.org)/)?(doi:)?(10\.\d{4,5}/.+)\Z",
        url,
    )
    if match is None:
        return None
    return match.group(5).lower()

images = [x["image"] for x in posts if x.get("image", None) is not None]
image = images[feature_image]
markdown = f"![]({image})\n\n"
markdown += posts_as_string
Markdown(markdown)

Nigersaurus in the mist

https://doi.org/10.59350/gstye-11z15
Published February 28, 2024 in Sauropod Vertebra Picture of the Week
Matt Wedel
The best thing about collaborations is that I get to work with amazing colleagues from around the world.

re3data Call for Editorial Board

https://doi.org/10.59350/d7b07-96r15
Published February 28, 2024 in re3data COREF Project Blog
re3data Team
re3data Call for Editorial Board The re3data.org registry has been in operation for over 10 years and provides a curated index of over 3,000 research data repositories around the world from all disciplines.

Preservation as a Memento Mori and Matter of Ethics

https://doi.org/10.59348/jggmv-9rp63
Published February 29, 2024 in Martin Paul Eve
Martin Paul Eve
When training PhD students and other postgraduates, we often have a section on publication. It covers the basic background process, the rudiments of peer review, and what to expect. There is often, also, a lengthy discussion of ensuring that you publish in the place that will do most for your career. What is the prestige status of the journal, for instance?

Help users and developers of targets by answering questions!

https://doi.org/10.59350/pbm5y-w5540
Published February 29, 2024 in rOpenSci - open tools for open science
Will Landau, Maëlle Salmon
The package targets maintained by Will Landau, and its companion packages, are pipeline tools, that coordinate the pieces of computationally demanding analysis projects.Help sustain targets discussion forums!

Help make qualtRics better! Code contributions wanted

https://doi.org/10.59350/dtkg3-48502
Published March 1, 2024 in rOpenSci - open tools for open science
Julia Silge, Joseph O’Brien, Maëlle Salmon
The package qualtRics maintained by Julia Silge together with Joseph O’Brien provides functions to access survey results directly into R using the Qualtrics API. Qualtrics is an online survey and data collection software platform.Help test or improve qualtRics! Are you a heavy user of the Qualtrics survey tooling in general, and of the qualtRics R package in particular? Then you can help build and test the package.How to help?

Reflect, Enjoy, Do something! Lessons from the International Digital Curation Conference

https://doi.org/10.59350/rw1gx-8z853
Published March 1, 2024 in GigaBlog
Chris Hunter
Kevin Ashley’s (DCC director) closing remark at the IDCC (International Digital Curation Conference) 2024 meeting was “ Reflect, Enjoy, Do something! ”. Hopefully you will enjoy my doing something.

Proposed Reforms to the American Board of Surgery In-Training Examination (ABSI

https://doi.org/10.59350/qhkj3-3qs43
Published March 1, 2024 in I.D.E.A.S.
Tejas S. Sathe
This is a preprint of a manuscript currently under peer review. The American Board of Surgery In-Training Examination (ABSITE ^® ) is a multiple-choice exam administered yearly to surgical residents. The ABS provides the following information on its website: “The ABSITE is furnished to program directors as a formative evaluation instrument to assess residents’ progress.

Research Graph 101

https://doi.org/10.59350/d9k0q-57w08
Published March 1, 2024 in Stories by Research Graph on Medium
Research Graph
Author: Amir Aryani, https://orcid.org/0000-0002-4259-9774 In this article we look at Research Graph as an information model , and an approach to connect and capture the connections between research outputs, researchers and research activities. We explore the metadata model, and we discuss how to capture this graph in a Neo4j Graph Database.

Including ORCID identifiers in BibLaTeX (and using them)

https://doi.org/10.59350/bk8yd-b1307
Published March 2, 2024 in Syntaxus baccata
Lars Willighagen
On the Fediverse, @petrichor@digipres.club posited the question how to include identifiers for authors in Bib(La)TeX-based bibliographies: I have wanted to try including ORCIDs in bibliographies for a while now, and while CSL-JSON makes it nearly trivial to encode, neither CSL styles nor CSL processors are at the point where those can actually be inserted in the formatted bibliography.

Avoiding the word footgun(s)

https://doi.org/10.59350/xvy60-m1j11
Published March 3, 2024 in recology
Scott Chamberlain
I recently opened an issue in a repository for a package I’m working on to think about potential footguns and how to avoid them. That word “footguns” got me thinking;

rOpenSci Code of Conduct Annual Review

https://doi.org/10.59350/k0q1e-gzb74
Published March 4, 2024 in rOpenSci - open tools for open science
Yanina Bellini Saibene, Mark Padgham, Kara Woo, Natalia Morandeira
Our community is our best asset. It’s so important to us, it’s in our mission statement. We recognize that communities are not inclusive by default; they require deliberate attention, including an enforceable Code of Conduct. rOpenSci is committed to providing a safe, inclusive, welcoming, and harassment-free experience for everyone.

Shiny button weirdness

https://doi.org/10.59350/phhx2-n3474
Published March 4, 2024 in recology
Scott Chamberlain
I’ve been working on an inherited Shiny app at work for the past few months. One of the many frustrating things about Shiny lately has been around buttons. Well, it wasn’t really about buttons, but that’s where it started. I noticed that a number of buttons - some having file inputs, some having text inputs - did not trigger every time, and I expected them to trigger every time no matter what.

Harnessing The Power of Knowledge Graphs in Question Answering

https://doi.org/10.59350/2r9ae-ayf81
Published March 4, 2024 in Stories by Research Graph on Medium
Research Graph
Unlocking the Power of Questions — A deep dive into Question Answering Systems Author: Amanda Kau (ORCID: 0009–0004–4949–9284 ) Virtual assistants have popped up on numerous websites over the years.

How to use GPT-4V For Stoplight Report

https://doi.org/10.59350/j1hby-pt034
Published March 4, 2024 in Stories by Research Graph on Medium
Research Graph
Efficient creation of a stoplight report with data dashboard images Author: Yunzhong Zhang (ORCID: 0009–0002–8177–419X) Comparing data dashboards is crucial for understanding trends and performance differences. Traditionally, this task required manual effort, which was slow and sometimes inaccurate. Now, thanks to OpenAI’s GPT-4 with Vision (GPT-4V), we are able to automate and improve this process.

Enhancing Open-Domain Conversational Question Answering with Knowledge-Enhanced Models and…

https://doi.org/10.59350/2zfk4-m9n70
Published March 4, 2024 in Stories by Research Graph on Medium
Research Graph
Enhancing Open-Domain Conversational Question Answering with Knowledge-Enhanced Models and Knowledge Graphs How knowledge-enhanced language models and knowledge graphs are advancing open-domain conversational question answering Author: Wenyi Pi (ORCID: 0009-0002-2884-2771 ) When searching for information on the website, it is

Knowledge Graphs: Deep Dive into OpenAlex

https://doi.org/10.59350/g3zse-fdh02
Published March 5, 2024 in Stories by Research Graph on Medium
Research Graph
Unlocking the power of knowledge graphs in research catalogues: A deep dive into OpenAlex Author: Dhruv Gupta (ORCID: 0009–0004–7109–5403 ) Clive Humby, in 2006 rightly said, “Data is the new oil”. With data being present everywhere, it has never been more valuable.

Deeper understanding of In-Context Retrieval-Augmented Language Models

https://doi.org/10.59350/ch3em-a4h27
Published March 5, 2024 in Stories by Research Graph on Medium
Research Graph
Transformative Advances in Language Models through External Knowledge Integration Author: Qingqin Fang ( ORCID: 0009–0003–5348–4264) Introduction In the dynamic field of natural language processing, the integration of external knowledge has emerged as a pivotal strategy for enhancing the performance of language models.

Question Answering enhanced by Knowledge Graph

https://doi.org/10.59350/tfrff-c2c20
Published March 5, 2024 in Stories by Research Graph on Medium
Research Graph
Latest findings for the use of Knowledge Graph in the field of QA in multiple research directions Author: Xuzeng He ( ORCID: 0009–0005–7317–7426) Question Answering (QA), the ability to interact with data using natural language questions and obtaining accurate results, has been a long-standing challenge in computer science dating back to the 1960s.

Enhancing Language Models: The Role of Knowledge Graph Augmentation in Overcoming LLM Challenges

https://doi.org/10.59350/jp8jc-31729
Published March 5, 2024 in Stories by Research Graph on Medium
Research Graph
Author: Vaibhav Khobragade ( ORCID: 0009–0009–8807–5982) Introduction Large language models (LLMs) are becoming increasingly popular in natural language processing for their superior competence in various applications.

Citizen Science in the CWTS Knowledge Agenda

https://doi.org/10.59350/z2mgz-fq313
Published March 5, 2024 in Leiden Madtrics
Margaret Gold
Throughout a recent series of blog posts, we have been introducing the CWTS knowledge agenda for 2023-2028, which is divided into three new focal areas to organise our activities on specific themes.

Rogue Scholar has added author affiliation metadata

https://doi.org/10.53731/bxk5g-pxw97
Published March 5, 2024 in Front Matter
Martin Fenner
This week the Rogue Scholar science blog archive has added author affiliation name and ROR ID to the DOI metadata of about 2,500 of its 15K blog posts registered with Crossref. This makes Front Matter (the Crossref member registering Rogue Scholar DOIs) one of the top 10 ROR early adopters in Crossref metadata.

Citation.js: BibLaTeX Data Annotations support

https://doi.org/10.59350/8zzmb-mq510
Published March 5, 2024 in Syntaxus baccata
Lars Willighagen
Version 0.7.9 of Citation.js comes with a new feature: plugin-bibtex now supports the import and export of Data Annotations in BibLaTeX files. This means ORCID identifiers from DOI, Wikidata, CFF, and other sources can now be exported to BibLaTeX. Combined with a BibLaTeX style that displays ORCID identifiers, you can now quickly improve your reference lists with ORCIDs.

“Resolving empirical controversies with mechanistic evidence” – Some thoughts

https://doi.org/10.59350/ph9rk-dc120
Published March 5, 2024 in Politics, Science, Political Science
Ingo Rohlfing
Recently, I came across the open access article Resolving empirical controversies with mechanistic evidence. I have some thoughts on the arguments made in the article, but first a disclaimer: This is not meant to be a disciplinary beauty contest. The article raises a valuable point: Two or more quantitative models may produce contradictory results.

Citizen Scientists can see the wood for the trees

https://doi.org/10.59350/tgeg1-tv760
Published March 5, 2024 in GigaBlog
Hans Zauner
Few countries have a biodiversity that’s comparable to Brazil’s, including plant-based food sources that are little known elsewhere. Even in big cities you may pick all kinds of fruit directly from the tree, which can be both tasty and also interesting for the botanist. The Citizen Science project “Pomar Urbano” collects data on urban fruit-bearing plant in Brazilian cities.

Designing book covers in half an hour, redux

https://doi.org/10.59350/8nhy7-7e692
Published March 6, 2024 in Sauropod Vertebra Picture of the Week
Mike Taylor
Long-time readers may remember that back in 2013, Matt and I played a game where we each designed a cover, in half an hour, for a book whose name was randomly generated. Here’s what I came up with for The Name of the Names : I really enjoyed that process and even toyed with the idea of offering it as a service for hire, for people creating self-published books.

Newsletter

Thank you!