Attacking freetext nightmares with aRmed llamas

R and Ollama for medical text data quality and term extraction

Last updated on Jun 22, 2025 26 min read

A Free Text Horror Story

If you have talked to anyone who works with data, you have probably also heard them complain at length about issues related to data quality. Great efforts are taken to improve data “cleanliness”, without which, analysis will be flawed and confidence will be weak.

One common type of data element is a multiple choice option. Typically these are predefined categories, which enforces standard formatting (yay!). However, many have a final option for “Other, describe”. This allows a data entry clerk to provide an open ended free text description, up to a certain word limit. A free text field, being “free”, has few (if any) rules to control formatting and standardization. As a result, it can often become a dumping ground. If someone cannot find a decent choice in the drop down provided, “Other” may be selected and slight deviations from the categories are provided. This can lead to a mess of inputs that make the “Other” field unreliable. It could include transcription errors, typos, overlaps with predefined fields, or defining unknowns in various ways (NA, Unknown, Unsure, etc).

In public health data systems, free text is a common challenge. There is a whole wealth of research just to parse clinical notes and drug prescription instructions. Many medical forms have inputs with multiple choice options in addition to a “catch all” category for “Other”. For example, there may be several predefined treatment categories but an “Other” option is provided in case there is some new combination that is not listed. Ideally, the form would be improved to better organize the inputs and unknown combinations but that is not the messy data world we live in today.

Enter, Large Language Models (LLMs)

The free text problem is not new and there has been decades of work dedicated to this issue with many options available to address the challenges. Some of these include or are combinations of rule-based solutions (with dictionary searches), fuzzy matching, extensive use of regular expressions, and natural language processing algorithms. However, we are going to see how quickly we can get up and running with a basic use of one of the latest approaches: transformer based LLMs.

We want to use LLMs to perform a common activity in text analysis: Named Entity Recognition (NER). We will provide a set of drug treatment notes and provide instructions for how to identify and extract specific information.

Objective and expectations

Before we dive in, let’s lay out what we want to achieve in this rather introductory example.

Ultimately, we want to use a specific topic area, in this case health data, and analyze text information with minimal effort. As such, we do not anticipate an incredibly robust solution but rather a starting place for understanding. We also, given the domain of health data, want to use a local environment to reduce concerns for data exfiltration. To make it easy to implement we want the solution to be API based so that it is rather coding language agnostic, though we will use R in this worked example. As this is exploring a test case, we want an option that has minimal setup and cost.

These criteria point to using Ollama and an open source LLM model such as Meta’s Llama. Although there are alternatives abound, many of which may be better suited (such as spaCy and tailored models on Hugging face), using Ollama was incredibly straight forward in R and did not require knowledge of torch. In addition, we are not setting out here to solve recognized challenges problems with LLMs, such as scaling, result consistency, hallucinations, or a number of other well documented concerns. To some this solution may seem overly simplistic, to others complete overkill, but given the low barrier to entry it shows how to get started with LLMs for text analysis activities, such as Named Entity Recognition.

Setup

Ollama

To begin, we need to have an installation of Ollama; this is a rather simple process and basic instructions for all major operating systems are provided on their website: https://ollama.com/download

R packages

To use Ollama from R, we require a few packages, most notably {ollamar}.

library(ollamar)
library(glue)
library(httr2)
library(data.table)
library(purrr)

Drug treatment dataset

We will create a dataset based upon suggested drug treatments for several sexually transmitted infections. Although this may sound simple there are numerous complexities. Treatment varies by disease, site of infection, and risk groups (e.g. pregnant, MSM, etc). Furthermore, short hand may be used to describe treatment delivery and dosage such as BID (twice a day), TID (three times a day), PO (oral route), IV (intravenous), and IM (intramuscular). Consequently, there are many different combinations of treatment and how they are administered. This is further complicated by free text fields, allowing custom treatments to be entered, introducing new abbreviations, as well as data quality problems.

data_trmt <- create_trmt_data()
rmarkdown::paged_table(data_trmt[,"text", drop = FALSE], options = list(rownames.print = FALSE))

ABCDEFGHIJ0123456789

text <chr>
doxycycline 100 mg PO BID for 7 days (A-II)
azithromycin 1 g PO as a single dose (A-I; AII for eye)
ceftriaxone 250 mg IM as a single dose (A-I) PLUS azithromycin 1 g PO as a single dose (BII), (B-III for pharyngeal infections)
ceftriaxone 250 mg IM as a single dose (A-I) PLUS azithromycin 1 g PO as a single dose (BII)
Trmt given OOP
Due to resistance to azithromycin, was not provided as monothearpy, was provided in addition to gentamicin
Long-acting benzathine penicillin G 2.4 mu (Bicillin L-A) IM weekly for 2 doses (C-III)
unable to recall
cipro 250mg initially
acyclovir 400 mg PO TID initiated at 36 weeks until parturition (A-I)

Saying hello to a Llama

With this dataset, we want to summarise the treatment details and extract specific information about drugs used as well as their dosage and frequency. First, we grab an open source model: llama3.2:latest. At time of writing this is a (relatively) new model that is neither too big nor too small and, in its description, is suitable for instruction following and summarization.

pull('llama3.2')

Now we make sure the model is listed and we can connect to it.

list_models()
test_connection()

##              name size parameter_size quantization_level            modified
## 1 llama3.2:latest 2 GB           3.2B             Q4_K_M 2025-06-22T16:32:59

One feature (or challenge) with LLMs, is they can be rather “creative”. For consistency in our results, we will tweak the model to be less creative and more deterministic.

search_options('temperature')

## Matching options: temperature

## $temperature
## $temperature$description
## [1] "The temperature of the model. Increasing the temperature will make the model answer more creatively."
## 
## $temperature$default_value
## [1] 0.8

llm_temp <- 0.2

Summarise with Ollama

Our first objective is to ask for a summarization of the drug treatment information column, which contains standard treatments as well as free text inputs. To achieve this, we first create a message with the desired instructions (which is easy with the help of glue()). This is passed to llama3.2 and we print the results.

msg_summarise <- create_message(
  glue(
    'You are an expert in drug treatments in medicine, summarise the drugs used in the following clinical notes as accurately as possible in under 100 words: 
    {paste(data_trmt$text, collapse = "\n")}'
    )
  )

chat('llama3.2', message = msg_summarise, output = 'text', temperature = llm_temp) |>
  cat()

## Based on the clinical notes, the following drugs were used:
## 
## 1. Antibiotics:
##    - Doxycycline: 100 mg PO BID for 7 days (A-II), 100 mg PO BID for 28 days (B-II), and 250mg + cefixime 800mg
##    - Azithromycin: 1 g PO as a single dose (A-I), 2 g PO as a single dose (A-I), 1 g PO as a single dose (B-II), 2 g PO as a single dose (AI), and 1 g PO as a single dose (B-II)
##    - Ceftriaxone: 250 mg IM as a single dose (A-I), 250 mg IM as a single dose (A-I), 2 g IV daily for 10-14 days (B-II), and 2 g IV/IM as a single dose (A-II)
##    - Cefixime: 800 mg PO as a single dose (A-I), 800 mg PO as a single dose (A-I), and 400 mg PO qd x 7 days
##    - Levofloxacin: unknown dose
##    - Moxifloxacin: 400 mg PO daily
##    - Gentamicin: 240 mg IM in 2 separate 3-mL injections of 40 mg/mL solution (B-II)
##    - Penicillin G: benzathine penicillin G 2.4 mu (Bicillin L-A) IM weekly for 2 doses, and crystalline penicillin G 4 mu IV q4h for 10-14 days
## 
## 2. Antivirals:
##    - Acyclovir: 400 mg PO TID for 7-10 days (A-I), 400 mg PO BID (A-I), 800 mg PO TID x 2 days, and 500 mg PO BID or 1 g PO QD
##    - Valacyclovir: 500 mg PO BID for 3 days (B-I), 500 mg PO QD for 10 days (A-I), and 1 g PO QD for 3 days (B-I)
##    - Famciclovir: 125 mg PO BID for 5 days (B-I)

This seems to do a half decent job of summarizing the content but it does vary between runs. Furthermore, if you look closely, with such a basic prompt, it may not organize items in an ideal or consistent way.

Exploratory examples

Let’s continue with our journey and see how the LLM performs with Named Entity Recognition. We start with another basic prompt; this is often referred to as a zero shot prompt as no prior examples of the task are being given from the user.

sys_prompt <- create_message(
  "You are an expert in Named Entity Recognition specializing in extracing drug information for medical treatments.",
  role = 'system'
  )

Before we provide it our entire data set, we will try with one sample: “azithromycin 10mg per day plus doxycyline”

prompt <- append_message(
  "Extract the medicinal drug from the following text: 'azithromycin 10mg per day plus doxycyline'",
  role = 'user',
  sys_prompt
  )

chat('llama3.2', prompt, output = 'text', temperature = llm_temp) |> 
  cat()

## The medicinal drugs extracted from the given text are:
## 
## 1. Azithromycin
## 2. Doxycycline

On this basic example, llama3.2 performed both accurately and consistently. However, the results returned are not in a format that is conducive for further analysis. In other more complex examples, the results may be undesirable and verbose. You could get a response like:

“Based on the provided text, it is challenging to extract specific medicinal drugs without additional context or information. However, I can suggest some possible approaches..

To receive a more workable output, we need to tell the model to return information in a structured format. This is easy to achieve with the format parameter, which takes an object that follows a JSON style structure. Below we define a required character field for the drug name.

format_basic <- list(
  type = "object",
  required = list("DRUG"),
  properties = list(
    DRUG = list(type = "string") # Drug field and data type to return
  )
)

chat('llama3.2', prompt, output = 'structured', format = format_basic, temperature = llm_temp)

## $DRUG
## [1] "Azithromycin and Doxycycline"

We can probably do better than this format. For subsequent analysis in R we probably want the returned values in an array structure to separate each drug.

format_array <- list(
  type = "object",
  required = list("DRUG"),
  properties = list(
    DRUG = list(
      type = "array",
      items = list(type = "string")
      )
    )
  )

chat('llama3.2', prompt, output = 'structured', format =  format_array, temperature = llm_temp)

## $DRUG
## [1] "Azithromycin" "Doxycycline"

It is clever enough to return nothing if uncertain and if no valid drug names are provided.

prompt <- append_message(
  "Extract the medicinal drug from the following text, return nothing if uncertain: 
  'Was given 3 pills'",
  role = 'user', sys_prompt
  )
chat('llama3.2', prompt, output = 'structured', format = format_array, temperature = llm_temp)

## $DRUG
## [1] "pill"

However, this may be inconsistent and highly dependent upon the prompt so some trial and error is needed. In some cases it will return an unwanted value instead of NA. For example, some prompts may return one of the two results:

## $DRUG
## list()

## $DRUG
## [1] "pills"

Extract details

Building upon our basic NER example, we will now provide more context via few shot prompting. In addition, we will raise the stakes by requesting extraction of more information: DOSE and FREQ.

Structured format

To obtain a desired output from the prompt, we define a slightly more detailed format, with properties for the drug name (DRUG), dosage (DOSE) and frequency of administration (FREQ).

format_detailed <- list(
  type = "object",
  required = list('ENTRY'),
  properties = list(
    ENTRY = list(
      type = "array",
      items = list(
        type = 'object',
        required = list('DRUG', 'DOSE', 'FREQ'),
        properties = list(
          DRUG = list(type = list("string", "null")), 
          DOSE = list(type = list("string", "null")),
          FREQ = list(type = list("string", "null"))
          )
        )
      )
    )
  )

New system message

With few shot prompting, we provide additional context and examples to orientate the prompt responses. First, we tell llama3.2 that is must perform text extraction for specific entities. Second, we provide some rules based upon known complications in the data. For example, there are many niche medical terms used which need to be understood for dosage. Lastly, we provide a few of the more difficult examples and how they should be parsed.

sys_prompt_fewshot <-  create_message(
  role = "system",
  'You are now an expert in text extraction of medical drug treatment information from clinical notes. There are three types of entities to extract: (1) names of drugs used for medical treatment (DRUG), (2) dose of the drug (DOSE), and (3) the frequency the drug is given for that dose (FREQ).
  
  It is important to remember the following when extracting entities:
  (1) There may be multiple drugs listed in the text.
  (2) Drugs may be abbreviated or misspelled and should be corrected if there is a confident replacement. If there is no confident replacement, return ["NA"]
  (3) Generic terms for drugs such as "drug" or "pills" should be replace with ["NA"].
  (4) Not all drugs identified will have a DOSE or FREQ, if DOSE and FREQ are missing or unknown their values should be ["NA"].
  (5) Only identify DRUG, DOSE, or FREQ information when there is a confident answer, do not guess. If there is no information (missing, uncertain, or unknown) return the value ["NA"] for each entity.
  (6) The DRUG, DOSE, and FREQ should always be reported as a complete set. If a DRUG is missing, the DOSE and FREQ should also be missing as ["NA"]. If the drug is unknown, both DOSE and FREQ should return ["NA"]. 
  (7) Do not extract abbrevations related to the route the drug is administered: such as "PO" for oral administration, "IV" for intravenous, and "IM" intramuscular.
  (8) Be aware of the following abbreviations for FREQ:  "qh" for "every hour", "q4h" for "every four hours", "QD" for "once daily", "BID" for "twice daily", "TID" for "thrice daily".
  
  Input Example 1: "The doctor provided both clindamycin 5ug PO per day for 7 days, and amoxacillin one dosage (A-II)."
  Output 1: {
  ENTRY: [
    {DRUG: "clindamycin", DOSE: "5ug", FREQ: "qd 7 days"},
    {DRUG: "amoxacillin", DOSE: "one", FREQ: "NA"}
    ]
  }
  
  Input Example 2: "No treatment was provided, not even amoxacillin."
  Output 2: {ENTRY: [{DRUG: "NA", DOSE: "NA", FREQ: "NA"}]}
                 
  Input Example 3: "amox and pen g was given for BID for 5 days but not clindamycin."
  Output 3: {
  ENTRY: [
    {DRUG: "amoxacillin", DOSE: "NA", FREQ: "BID for 5 days"},
    {DRUG: "penicillin g", DOSE: "NA", FREQ: "BID for 5 days"}
    ]
  }
                 
  Input Example 4: "Possibly given treatment but unsure, maybe it was cefixime."
  Output 4: {ENTRY: [{DRUG: "NA", DOSE: "NA", FREQ: "NA"}]}
  
  Input Example 5: "Refused treatment to 5 days BID cipro?"
  Output 5: {ENTRY: [{DRUG: "NA", DOSE: "NA", FREQ: "NA"}]}
                 
  Given the text below, extract the entities for drug (DRUG), dose (DOSE), and frequency (FREQ) and return the result in JSON format. Standardize the output to be in lower case, no special characters, trimmed white space, and spelled correctly.')

Apply to text

With the context provided, we now perform the numerous requests for each of the treatment entries. With {glue}, {ollamar}, and {httr2}, this is very easy to do!

# Basic func (lazy and leaky)
create_chat_req <- function(text) {
  prompt <- append_message(
    role = 'user',
    content = glue('Here is the text to perform the extraction: {text}'),
    sys_prompt_fewshot
    )
  chat('llama3.2', prompt, output = 'req', format = format_detailed, temperature = llm_temp)
}

reqs <- lapply(data_trmt$text, create_chat_req)

Now send each request in parallel to speed things up and extract in the structured format.

resps <- req_perform_parallel(reqs)
struc_resps <- lapply(resps, resp_process, 'structured')

Knowing that LLMs may return unexpected results, we add a check on returned values. In this instance, we want every entry to have 3 values (drug, dose, freq) and in situations where nothing is returned the result should have NA/Missing values. For example, this may be a common entry if nothing of interest is found.

## [[1]]
## [[1]]$ENTRY
## list()

But this is actually what we want returned…

## [[1]]
## [[1]]$ENTRY
##   DRUG DOSE FREQ
## 1   NA   NA   NA

When a result is not in the expected format we can request a few more attempts before it errors out.

malform_idx <- vector('numeric')
malform_idx <- which(sapply(struc_resps, \(x) length(x$ENTRY) == 0))
if(length(malform_idx) > 0 ) {
  req_attempt <- req_perform_parallel(reqs[malform_idx])
  req_attempt_struct <- lapply(req_attempt, resp_process, 'structured')
  fixed_idx <- which(sapply(req_attempt_struct, \(x) length(x$ENTRY) > 0))
  struc_resps[malform_idx] <- req_attempt_struct[fixed_idx]
}

For easier analysis we collapse the list of returned results into a data.frame.

# Calculate number of entries 
length_entry <- sapply(struc_resps, \(x) length(pluck(x, 'ENTRY', 1)))

data_trmt_processed <- purrr::list_flatten(struc_resps) |> 
  purrr::list_rbind()

Review results and conclusions

With the final results combined, we can now compare them to the raw text inputs. We achieve this with a merge on the row_id. Where multiple drug entries were found, the row_id is duplicated.

data_trmt_processed$row_id <- rep(seq(1:length(struc_resps)), length_entry)
data_trmt_processed <- merge(data_trmt_processed, data_trmt, by = 'row_id', all.x = TRUE, all.y = FALSE)
knitr::kable(data_trmt_processed) |>
  kableExtra::kable_styling(bootstrap_options = c("hover", "condensed")) |>
  kableExtra::scroll_box(height = "500px")

row_id	DRUG	DOSE	FREQ	text
1	doxycycline	100mg	BID for 7 days	doxycycline 100 mg PO BID for 7 days (A-II)
2	azithromycin	1g	single dose	azithromycin 1 g PO as a single dose (A-I; AII for eye)
3	ceftriaxone	250 mg	single dose	ceftriaxone 250 mg IM as a single dose (A-I) PLUS azithromycin 1 g PO as a single dose (BII), (B-III for pharyngeal infections)
3	azithromycin	1 g	PO as a single dose (B-III for pharyngeal infections)	ceftriaxone 250 mg IM as a single dose (A-I) PLUS azithromycin 1 g PO as a single dose (BII), (B-III for pharyngeal infections)
4	ceftriaxone	250mg	single	ceftriaxone 250 mg IM as a single dose (A-I) PLUS azithromycin 1 g PO as a single dose (BII)
4	azithromycin	1g	single	ceftriaxone 250 mg IM as a single dose (A-I) PLUS azithromycin 1 g PO as a single dose (BII)
5	NA	NA	NA	Trmt given OOP
6	azithromycin	NA	NA	Due to resistance to azithromycin, was not provided as monothearpy, was provided in addition to gentamicin
6	gentamicin	NA	NA	Due to resistance to azithromycin, was not provided as monothearpy, was provided in addition to gentamicin
7	benzathine penicillin g	2.4 mu	weekly	Long-acting benzathine penicillin G 2.4 mu (Bicillin L-A) IM weekly for 2 doses (C-III)
8	NA	NA	NA	unable to recall
9	cipro	250mg	initially	cipro 250mg initially
10	acyclovir	400mg	tid	acyclovir 400 mg PO TID initiated at 36 weeks until parturition (A-I)
11	doxycycline	100.000	NA	drugs: doxycycline dose: 100.000 dose unit
12	acyclovir	400mg	tid for 7-10 days	acyclovir 400 mg PO TID for 7-10 days (A-III)
13	azithromycin	2g	single	Due to pregnancy was not provided azithromycin* 2 g PO as a single dose, was provided cefixime 800 mg PO as a single dose plus azithromycin 1g
13	cefixime	800mg	single	Due to pregnancy was not provided azithromycin* 2 g PO as a single dose, was provided cefixime 800 mg PO as a single dose plus azithromycin 1g
14	cefixime	800mg	single	cefixime 800 mg PO as a single dose (A-I) PLUS azithromycin 1g PO as a single dose (BII)
14	azithromycin	1g	single	cefixime 800 mg PO as a single dose (A-I) PLUS azithromycin 1g PO as a single dose (BII)
15	azithromycin	2g	single dose	azithromycin* 2 g PO as a single dose (AI) PLUS gentamicin 240 mg IM in 2 separate 3-mL injections of 40 mg/mL solution (B-II)
15	gentamicin	240mg	separate 3-mL injections	azithromycin* 2 g PO as a single dose (AI) PLUS gentamicin 240 mg IM in 2 separate 3-mL injections of 40 mg/mL solution (B-II)
16	ceftriaxone	2g	daily	ceftriaxone 2g IV daily for 10-14 days (B-II)
17	NA	NA	NA	declined treatment
18	moxifloxacin	400mg	daily	moxifloxacin 400 mg po daily
19	azithromycin	2g	single	azithromycin* 2 g PO as a single dose (A-I) PLUS gemifloxacin 320 mg PO in a single dose (B-II)
19	gemifloxacin	320mg	single	azithromycin* 2 g PO as a single dose (A-I) PLUS gemifloxacin 320 mg PO in a single dose (B-II)
20	na	na	na	? ceftrix but no record documented of taking it
21	doxycycline	100mg	bid for 28 days	doxycycline 100 mg PO BID for 28 days (B-II; C-III for HIV-infected)
22	azithromycin	2g	single dose	azithromycin* 2 g PO as a single dose (A-I) PLUS gentamicin 240 mg IM in 2 separate 3-mL injections of 40 mg/mL solution (B-II)
22	gentamicin	240mg	separate 3-mL injections	azithromycin* 2 g PO as a single dose (A-I) PLUS gentamicin 240 mg IM in 2 separate 3-mL injections of 40 mg/mL solution (B-II)
23	benzathine penicillin g	2.4 mu	weekly	Long-acting benzathine penicillin G 2.4 mu (Bicillin L-A) IM weekly for 3 consecutive weeks (A-II)
24	cefixime	800mg	single	cefixime 800 mg PO as a single dose (A-l) PLUS azithromycin 1 g PO as a single dose (BII)
24	azithromycin	1g	single	cefixime 800 mg PO as a single dose (A-l) PLUS azithromycin 1 g PO as a single dose (BII)
25	ceftriaxone	800mg	single	cefxime 800 mg single dose PLUS azith 1g as a single dose
25	azithromycin	1g	single	cefxime 800 mg single dose PLUS azith 1g as a single dose
26	valacyclovir	500mg	bid	valacyclovir 500 mg PO BID initiated at 36 weeks until parturition (B-I)
27	NA	NA	NA	Was treated out of province
28	famciclovir	250mg	bid	famciclovir 250 mg PO BID (A-I)
29	benzathine penicillin g	2.4 mu	NA	Doxycycline is not recommended for use during pregnancy, provided benzathine penicillin G 2.4 mu
29	doxycycline	NA	NA	Doxycycline is not recommended for use during pregnancy, provided benzathine penicillin G 2.4 mu
30	NA	NA	NA	treatment received
31	amoxicillin	875mg	qd 7 days	amoxi clave 875 mg for 7 days
32	doxycycline	100mg	BID for 7 days	doxycycline 100 mg PO BID for 7 days (A-I; A-II for eye)
33	azithromycin	1g	single	azithromycin 1 g PO as a single dose (A-I)
34	ceftriaxone	250mg	single	ceftriaxone 250 mg IM as a single dose (A-I) PLUS azithromycin 1 g PO as a single dose (BII)
34	azithromycin	1g	single	ceftriaxone 250 mg IM as a single dose (A-I) PLUS azithromycin 1 g PO as a single dose (BII)
35	doxycycline	100mg	BID for 7 days	doxycycline 100 mg PO BID for 7 days (A-I)
36	amoxicillin	500mg	tid for 7 days	amoxicillin 500 mg PO TID for 7 days (A-I)
37	doxycycline	100mg	bid for 14 days	doxycycline 100 mg PO BID for 14 days (B-II; C-III for HIV-infected)
38	azithromycin	1g	single dose	azithromycin* 1 g PO as a single dose (B-I)
39	NA	NA	NA	No treatment was provided
40	doxycycline	250mg	NA	doxycycline 250mg + cefixime 800mg
40	cefixime	800mg	NA	doxycycline 250mg + cefixime 800mg
41	azithromycin	1g	single	azithromycin 1 g PO as a single dose (A-II)
42	levofloxacin	unknown	NA	levofloxacin dose unknown
43	blue pills	NA	NA	A handfull of blue pills
44	acyclovir	400mg	bid	acyclovir 400 mg PO BID (A-I)
45	ceftriaxone	2g	single	ceftriaxone 2 g IV/IM as a single dose (A-II) PLUS azithromycin 1 g PO as a single dose (BII)
45	azithromycin	1g	single	ceftriaxone 2 g IV/IM as a single dose (A-II) PLUS azithromycin 1 g PO as a single dose (BII)
46	NA	NA	NA	Attempted contact, but could not locate
47	famciclovir	125mg	bid for 5 days	famciclovir 125 mg PO BID for 5 days (B-I)
48	azithromycin	2g	single dose	azithromycin* 2 g PO as a single dose (AI) PLUS gentamicin 240 mg IM in 2 separate 3 mL injections of 40 mg/mL solution (B-II)
48	gentamicin	240mg	separate injections	azithromycin* 2 g PO as a single dose (AI) PLUS gentamicin 240 mg IM in 2 separate 3 mL injections of 40 mg/mL solution (B-II)
49	valacyclovir	1g	qd	valacyclovir 1 g PO QD for 3 days (B-I)
50	moxifloxacin	0.5% 1 drop	qh	moxifloxacin 0.5% 1 drop os every hour x 1 day
51	cefixime	800 mg	single	cefixime 800 mg PO as a single dose (A-I for MSM, B-III for pharyngeal infections) PLUS azithromycin 1 g PO as a single dose (B-II for MSM), (B-III) for pharyngeal infections
51	azithromycin	1 g	single	cefixime 800 mg PO as a single dose (A-I for MSM, B-III for pharyngeal infections) PLUS azithromycin 1 g PO as a single dose (B-II for MSM), (B-III) for pharyngeal infections
52	white pill	NA	NA	Six white and two blue pills
52	blue pill	NA	NA	Six white and two blue pills
53	valacyclovir	500mg	bid for 3 days	valacyclovir 500 mg PO BID for 3 days (B-I)
54	azithromycin	2g	single dose	azithromycin* 2 g PO as a single dose (A-I) PLUS gentamicin 240 mg IM in 2 separate 3-mL injections of 40 mg/mL solution (B-II)
54	gentamicin	240mg	separate 3-mL injections	azithromycin* 2 g PO as a single dose (A-I) PLUS gentamicin 240 mg IM in 2 separate 3-mL injections of 40 mg/mL solution (B-II)
55	NA	NA	NA	Lost to follow up
56	cefixime	400mg	qd	cefixime 400 mg po qd x 7 days
57	penicillin	crystalline penicillin g	q4h	penicillin desensitization followed by crystalline penicillin G 4 mu IV q4h for 10-14 days
57	penicillin	mu	NA	penicillin desensitization followed by crystalline penicillin G 4 mu IV q4h for 10-14 days
58	azithromycin	NA	NA	? azithromycin
59	azithromycin	2g	single	azithromycin* 2 g PO as a single dose (A-I) PLUS gemifloxacin 320 mg PO in a single dose (B-II)
59	gemifloxacin	320mg	single	azithromycin* 2 g PO as a single dose (A-I) PLUS gemifloxacin 320 mg PO in a single dose (B-II)
60	bicillin	NA	NA	refused bicillin treatment
61	cefixime	800mg	single	cefixime 800 mg PO as a single dose (A-I) PLUS azithromycin 1g PO as a single dose (BII)
61	azithromycin	1g	single	cefixime 800 mg PO as a single dose (A-I) PLUS azithromycin 1g PO as a single dose (BII)
62	unknown medication	NA	NA	unknown medication provided
63	crystalline penicillin g	4 mu	q4h	crystalline penicillin G 4 mu IV q4h for 10-14 days (A-II)
64	valacyclovir	500mg, 1g	BID, QD	valacyclovir 500 mg PO BID or 1 g PO QD (AI) [for patients with > 9 recurrences per year]
65	benzathine penicillin g	2.4 mu	single	Long-acting benzathine penicillin G 2.4 mu (Bicillin L-A) IM as a single dose (A-II;C-II for HIV-infected)
66	azithromycin	2g	single	azithromycin* 2 g PO as a single dose (A-I) PLUS gemifloxacin 320 mg PO in a single dose (BII)
66	gemifloxacin	320mg	single	azithromycin* 2 g PO as a single dose (A-I) PLUS gemifloxacin 320 mg PO in a single dose (BII)
67	valacyclovir	500mg	qd	valacyclovir 500 mg PO QD (A-I) [for patients with < 9 recurrences per year]
68	azithromycin	2g	single	azithromycin* 2 g PO as a single dose (A-I) PLUS gemifloxacin 320 mg PO in a single dose (B-II)
68	gemifloxacin	320mg	single	azithromycin* 2 g PO as a single dose (A-I) PLUS gemifloxacin 320 mg PO in a single dose (B-II)
69	acyclovir	800mg	tid	acyclovir 800 mg PO TID x 2 days
70	valacyclovir	1g	BID for 10 days	valacyclovir 1 g PO BID for 10 days (A-I)
71	famciclovir	250mg	tid for 5 days	famciclovir 250 mg PO TID for 5 days (A-I)
72	cipro	500	bid	cipro 500 bid x 1 wk
73	NA	NA	NA	cream given
74	NA	NA	NA	See notes
75	peng	2.4 mu	NA	peng 2.4 mu

How well did the LLM perform compared to our expectations and defined rules?

First the good:

It was able to understand negation successfully
It usually ignored treatments when there was uncertainty in the phrasing
In most cases it could extract the multiple possibilities and assign the closest related dosage

… now the less good:

Missing values were returned in varying formats
Several different formats were used for identical drugs, often due to confusion over abbreviations
Low confidence in results of FREQ data due to inconsistencies
Returned missing where a valid drug should have been found :(

Overall, considering the minimal effort and model used, the results performed better than expected. With a model that is more aware of medical terms, I would expect better performance. Given that free text fields are inherently known for data quality issues, I can foresee methods such as those presented here to be of great benefit in identifying notable data quality problems in text information for human review.

References

R LLM

Allen O'Brien

Infectious Disease Epidemiologist

I am an epidemiologist with a passion for teaching and all things data.