Inherent limitations of the tests we currently have make it unlikely that applying testing to the entire population is going to be very useful.
The elite-level consensus has been for some time now that the solution to all things COVID-19 is testing. The fancy group of talking heads who make the rounds daily on cable news certainly seems convinced more testing would have avoided the scale of the current crisis and that more testing will allow a safe exit from the current lockdown that has 30 million Americans unemployed.
Surprisingly little attention seems to have been paid to some practical matters related to trying to test tens of millions of Americans. Many are laser focused on creating an infrastructure to do testing, but have spent little time on discussing obstacles that relate to the current capability of the available tests.
We Have Two Kinds of Tests
At the moment there are two types of tests: real-time polymerase chain reaction (RT-PCR) and serologic testing. RT-PCR detects the virus’s genetic code, while serologic testing detects antibodies the body makes in response to the virus. Both forms of testing have limitations that are important to consider when trying to evaluate the accuracy of tests.
Diagnostic tests have traditionally been evaluated based on sensitivity and specificity. Sensitivity refers to the probability a test will return a positive result in someone who has the disease being tested for, while specificity refers to the probability a test will return a negative result in someone who does not have the disease being tested for.
A poorly sensitive test will miss a lot of people who actually have a disease, while a poorly specific test will say a lot of patients have a disease when they actually don’t. So a very accurate test should have a high degree of sensitivity as well as specificity.
Yet even a test that appears to be very good by these metrics may end up being almost useless because test performance is highly dependent on the underlying suspicion of disease. This is mostly why doctors don’t recommend getting tested for everything just because you can. Even cancer screenings are targeted to try to ensure the screened population is at higher risk of disease because there may be more harm than good if 20-year-olds were to start getting mammograms en masse.
Understanding why requires understanding that sensitivity and specificity, while important, are less important to patients than predictive values. Patients want to know the probability they have a disease if their test is positive or negative. Sensitivity and specificity are simple probabilities that tell you how the test performs in people whose disease status is already known. Converting these numbers to predictive values requires knowing the underlying prevalence of the disease in the population being tested.
Let’s Illustrate With an Example
As an example, let’s imagine an island of 1,000 individuals with a prevalence of lupus of 10 percent. This means 100 patients have lupus on the island (0.1 x 1,000). If we use the sensitivity and specificity from above (90 percent sensitive, 95 percent specific), 90 of the 100 patients are true positives, and 855 (0.95 x 900) patients are true negatives. This leaves 10 false negatives, and 45 false positives.
This means that this particular test will have 135 positive tests. But only 90 will be truly positive, the remaining 45 are false positives, which gives a 66 percent chance that a positive test means actually having the disease. Uninspiring probabilities, to say the least.
The situation can be improved markedly by greatly improving how specific a test is to reduce the number of false positives, but even a test that is 99 percent specific requires a prevalence of disease more than 10 percent for the test result to be meaningful.
The problem for coronavirus testing is that it is likely circulating in most places in the United States in very small single-digit numbers, except for a small number of ZIP codes located in a few hotspots in the country like certain counties in New York, New Jersey, and Michigan.
Why The Other Kind of Testing Also Has Problems
Our current testing does include RT-PCR, which is nearly 100 percent specific. This means everyone who tests positive with this test has the virus, but unfortunately the test has been found to have a high rate of false negatives, so a negative test doesn’t rule out disease.
PCR also only speaks to disease status at the time of the test. It’s possible a PCR-negative patients acquires the virus the following day. Complicating matters even further is that we may shed viral RNA that PCR can detect for up to six weeks after an infection, although it is definitely not the case that this means people are infectious for six weeks.
Source: JAMA.
Antibody tests to date have not been robustly validated in part because this is hard to do. Finding true positive and negative controls is challenging. Antibody tests are also negative early in the course of disease because the body has not had time to produce enough detectable antibody, and may be falsely positive if they crossreact with other commonly circulating coronaviruses that are known to cause the common cold.
So inherent limitations of the tests we currently have make it unlikely that applying testing to the entire population is going to be very useful. Testing for high-risk patients like health-care workers and symptomatic patients is important, not because it will be perfect for the individual patients, but because it can track outbreaks of disease in the population.
This Means Test-Based Contact Tracing Won’t Work Well
These considerations make it even harder to understand how test-based contact tracing and isolation will be effective. If we test too broad of a population, we are likely to make life unnecessarily difficult for a vast number of Americans who would face the risk of isolation based on an inaccurate test, and spend vast public health resources doing contact tracing for no good reason.
Even if the tests performed much better than they do currently, the architecture that is being discussed to do contact tracing has major unresolved issues. The cool-sounding idea is to use smartphones to track who citizens come into contact with. Theoretically, once a patient tests positive, a list of other phones and citizens the patient came into contact with over the prior 48 hours could easily be tracked.
Practically speaking, this would require a large proportion of patients (upward of 60 percent) in a population to download and use the app, and there would be a high probability of false positives generated by phones that ping each other through walls or ping the runner jogging by you in the park six feet away. It’s also notable that Singapore, a very small country with a population of 5 million, is in the midst of a significant outbreak that forced a widespread lockdown despite its widely praised app-based contact tracing.
Testing Isn’t a Panacea at All
Much of the enthusiasm to test more comes from those that link countries with high testing rates to low death rates. In doing so, proponents conveniently ignore that the various countries also differ in size, density, geographic location, population demography, connectedness, and travel/social distancing policies.
Also ignored is the amount of testing currently taking place in the United States. New York is testing at a rate five times higher than South Korea. At the moment, testing capacity in most of the country actually exceeds the demandfor testing.
Testing is vital to tracking outbreaks in communities and informing policy locally and nationally, but it is profoundly unclear that reaching some magic higher threshold of testing is the answer to our problems.