Elena Robu | Astun Technology
Photo by YesMore Content on Unsplash
In simple terms, metadata is data about data.
Quality
- is just a hyphen
Findable
Accessible
Interoperable
Reusable
Why are we doing this? 🤷
How do we do this now? 🕰️
Metadata Crawler open source plugin for Talend Open Studio (aka Crawler)
"industrial strength open source NLP in Python"
NER model trained on OS Open Names
Pattern recognition model (postcodes)
Text Classification
Photo by Hans-Jurgen Mager on Unsplash