AI & Machine Learning Blog

October 12, 2022

AI for Pathology at Unprecedented Scale

Views on artificial intelligence (AI) today tend to fall into two camps: There are those who lean fully into its hype, making claims that AI is more profound than fire, and there are the cynics, who see it as a biased black box that can never compare to the abilities of the human brain. The truth is somewhere in the middle. In healthcare, AI does hold great promise for helping physicians and impacting patients, perhaps on a greater scale than common clinical technologies. Yet to deliver on that promise, it must be consistently tested and improved, and used not as the final word in diagnosis, but simply as a tool to aid physicians. This will ensure that AI always remains safe, effective, and equitable for patients.

The Promise of AI

Despite the mixed views on the importance of AI, it has undoubtedly become an integral part of our daily lives. There is little that AI hasn’t touched, from banking, to Netflix, even customer service chatbots. Naturally, AI has also made its way into healthcare, and now, pathology is the next logical step. Introducing AI into pathology certainly offers many benefits. First, AI can run automatic quality control, ensuring that digitized slide images that will be assessed are scanned clearly, in focus, and free from debris. Once digital cases have been created, AI can also help pathologists quickly triage cases by indicating which slides are suspicious for cancer. Upon review, AI can then guide the pathologist to the area on the slide itself that seemed potentially cancerous, or it can offer an instant second opinion when used after a pathologist’s initial read. Finally, AI offers quality assurance of the sign out process to further enhance pathologist and patient confidence.

Importantly, all of this can be done on hematoxylin and eosin (H&E)-stained slides. As it stands today, H&E is ubiquitous thanks to its low cost and the reproducibility of the pre-analytical process. With AI, we can pull even more information out of H&E-stained slides, including quantifying and grading tumors, and identifying known molecular biomarkers or even novel biomarkers that go beyond what is discernable for the human eye today. This gives pathologists the ability to create more comprehensive reports at the time of diagnosis, reducing the need for additional stains, minimizing delays, and accurately guiding additional testing or treatment.

Challenges and Considerations for Clinical Application

As a result of its prominence in routine clinical use, H&E provides a vast dataset from which to train AI, addressing one of the key challenges that comes with building clinical grade tools. Pathology, as we know, is an empirical science that can’t be learned from a book. To become an excellent pathologist requires the review of tens of thousands of slides that represent the breadth and diversity of biology. Therefore, if AI is to be used to support a pathologist in their diagnosis, it also has to be exposed to at least as many slides as a human expert to be equally talented. So, any AI trained on only a small, curated dataset, as many early iterations were, will be unfit for the realities of clinical practice. In essence, it would be like training a self-driving car in an empty parking lot when it needs to perform on busy highways; It would be bound to fail.

Another challenge with building pathology AI is the complexity of the machine learning system required. Pre-built, off the shelf machine learning models are not an option, because whole-slide images are far too large. Manual annotation will not work either, as it is highly subjective and impossible to do at scale. It also limits AI to learning only what pathologists already know, thereby eliminating the potential of AI to provide discovery. Instead, multiple instance learning (MIL) provides a scalable, reliable approach.

MIL is done by training the system from both whole-slide images and their corresponding pathology reports to indicate to the algorithm which slides contain at least one instance of cancer, and which do not. The algorithm then compares these images repeatedly, finding the difference between those slides that contain cancer and those that do not. After many iterations of this process, the AI is able to learn what cancer looks like such that when new slides are analyzed, it can not only detect whether cancer is present, but where it is located on the slide.

The Paige Approach

This is the exact approach Paige took to training our AI. To train the system from a large and diverse dataset, Paige and Memorial Sloan Kettering digitized over 5 million whole slides representing patients from around the globe. Paige’s AI modules were thoughtfully trained on this data to ensure they can be trusted and work robustly across institutions, diverse patient data, and regardless of pre-analytical slide variations. In a foundational study conducted on whole-slide images from over 15,000 patients across the globe, we found that Paige AI was able to detect prostate cancer reliably at an unprecedented clinical-grade level.¹

Of course, we did not stop there. Continuous testing is key to ensuring AI remains safe and effective for real-world use for patients. Our latest study tasked 16 pathologists with the review of 610 whole-slide images prepared at multiple institutions globally. They reviewed the slides once without assistance, and then again with assistance from Paige AI. When Paige AI was used, diagnostic errors reduced by 70%.²

Paige’s rigorous approach to building and testing our AI ultimately helped us earn breakthrough designation from the FDA, a great leap forward for bringing pathology AI to the fore. Today, our AI for prostate cancer, Paige Prostate Detect, is the only FDA-approved AI-powered digital pathology product. Our hope is that it can help set the stage for the whole field of pathology AI to grow and positively impact patient care.

The Future of AI in Pathology

At the same time, we are further tapping into the potential of AI to teach us something new. Where with clinical AI we can make the existing diagnostic process easier, when applied to computational biomarkers, AI can help us discover entirely new ways of understanding and treating cancer.

Using H&E-stained slides, which are lower cost, faster, and more reproducible than other commonly used stains for biomarker identification, AI has the potential to identify the presence of known biomarkers as well as discover new biomarkers that are unknown to pathologists today. The benefit of this approach is that it reduces the subjectivity that comes with other biomarker tests and can be done in minutes rather than days or weeks. This means oncologists can make treatment decisions faster, which also benefits the patient. It can also guide enrollment in clinical trials, again laying the foundation for better patient outcomes. And better patient outcomes is what it’s all about! The true power of AI is the impact it can have on cancer research and clinical practice to transform patient lives.

References

¹Campanella, G., Hanna, M.G., Geneslaw, L. et al. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat Med 25, 1301–1309 (2019)

²Raciti, Patricia., Sue, Jillian., et al. “Clinical Validation of Artificial Intelligence Augmented Pathology Diagnosis Demonstrates Significant Gains in Diagnostic Accuracy in Prostate Cancer Detection.” Archives of Pathology & Laboratory Medicine (In Press).

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.

Necessary

Always Enabled

Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
__hssrc	session	This cookie is set by Hubspot whenever it changes the session cookie. The __hssrc cookie set to 1 indicates that the user has restarted the browser, and if the cookie does not exist, it is assumed to be a new session.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category .
cookielawinfo-checkbox-functional	1 year	The cookie is set by the GDPR Cookie Consent plugin to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Necessary" category .
cookielawinfo-checkbox-others	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Others".
cookielawinfo-checkbox-performance	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
elementor	never	This cookie is used by the website's WordPress theme. It allows the website owner to implement or change the website's content in real-time.

Analytics

Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.

Cookie	Duration	Description
__hssc	30 minutes	HubSpot sets this cookie to keep track of sessions and to determine if HubSpot should increment the session number and timestamps in the __hstc cookie.
__hstc	5 months 27 days	This is the main cookie set by Hubspot, for tracking visitors. It contains the domain, initial timestamp (first visit), last timestamp (last visit), current timestamp (this visit), and session number (increments for each subsequent session).
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_UA-144495997-2	1 minute	A variation of the _gat cookie set by Google Analytics and Google Tag Manager to allow website owners to track visitor behaviour and measure site performance. The pattern element in the name contains the unique identity number of the account or website it relates to.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
hubspotutk	5 months 27 days	HubSpot sets this cookie to keep track of the visitors to the website. This cookie is passed to HubSpot on form submission and used when deduplicating contacts.

October 12, 2022

AI for Pathology at Unprecedented Scale

Transforming Drug Discovery and Scientific Innovation with Foundation Model Technology

Paige & Cornell Tech Students Collaborate to Advance Patient-Centric AI Technology

Breaking Through the Complexity of Cancer Detection: A Practical Use Case of How Paige’s Foundation Model is Revolutionizing Pathology

The Digital Pathology Dilemma: Navigating Cloud and On-Prem Solutions