Donât tell Bob Rogersâ team something canât be done.
When Rogers embarked on an ambitious project to automate the processing of the more than 1.4 million electronically faxed documents received annually by the Center for Digital Health Innovation at the University of California, San Francisco (UCSF CDHI), advisors and vendors initially told him the project was âimpossible.â âWe had a panel of experts come to us as part of our due diligence who said âwhat youâre trying to do is impossibleâ,â Rogers, who serves as the centerâs expert in residence for artificial intelligence, said during a recent H2O.ai webinar . âWe were hopeful that information extraction from structured documents was possible, but we werenât sure.â
The Center for Digital Health Innovation, which provides the renowned UCSF health system the latest advanced technologies to support the centerâs innovative patient care, was challenged in efficiently and accurately handling this wide array of patient records including referrals, prescription requests, requisitions for medical equipment, lab results and other forms. Rogersâ team had a vision for not only easing the administrative burden on the centerâs staff but ultimately improving patient care and outcomes.
Medical records are among the most complex documents to process. Each medical facility, care provider and insurer has unique forms that donât necessarily follow a standard pattern. While these semi-structured documents include similar information, they do so in different formats and inconsistently labeled fields. âThese are not simple forms or forms that repeat,â Rogers said. âThere is nuance in primary care, current provider, referrals and current history that can be up to 100 pages. Figuring out who is who and what the intent is is a complex undertaking,â he added.
The center had experimented with optical character recognition (OCR) and robotic process automation (RPA) to extract information to limited success. According to Lu Chen, lead data scientist at UCSF CDHI who led what came to be known as the Intake Automation project: âWe tried to solve the problem with a template-based fax process, with predefined areas where OCR could look and extract information. (However), templates changed over time and the success rate dropped year over year.â
UCSF CDHI turned to the team at H2O.ai to explore how H2O Document AIÂ could overcome the initial limitations of Intake Automation. âUCSF came in with the RPA efforts they had tried, and they were data-rich when they came to us,â said Mark Landry, one of H2O.aiâs lead data scientists and a Kaggle Grandmaster who consulted on the project. They âwere already able to get more accurate characters from the screen and could deploy a more complex learning algorithmâ that could recognize, for example, what a patient name looks like regardless of the format or who sent the form.
H2O Document AI augmented the Intake Automation solution with the addition of intelligent character recognition (ICR) that utilizes dynamically learning algorithms for general character and word recognition, understanding of the documentâs layout and natural language processing (NLP)Â to make document management easier. H2O Document AI comprises six logical processes:
Rogers and Chen credit the tight cooperation between UCSF CDHI and the H2O.ai team for the success of Intake Automation. âThe collaboration process with H2O.ai was key in how we were able to succeed,â Rogers said. âUCSF and H2O were aligned in the mission, which makes for a fantastic partnership. (We) were able to come together with the right data and right conception of the problem and how it fits within our applications, and, frankly, I donât think Iâm overstepping to say that Lu and I were a bit starstruck working with Mark and some of the other Kaggle Grandmasters together on this project.â
Sri Ambati, CEO and co-founder of H2O.ai, said: âThe best innovation in AI is no longer at the level of the algorithm, but is actually at the co-creation of data scientists and domain experts. If you can make that (collaboration) happen easily and take the best of the context-setting and the best of the algorithms, and automate that to fine-tune and search, you have a very compelling outcome.â
To learn more about H2O Document AI and how it can address your document automation challenges, please visit our website .