Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Anyone interested in working on a team to create an Entity Resolution / Record Linkage Plug-In to be used inside Dataiku DSS. The vision would be to make this open-sourse. I'm particularly interested in this because I'm working with a Cascade Bicycle Club that could use a plugin like this to better manage its constituents. I know of several other non-profits who could also benefit from such a tool.
If you are interested in chipping in on this open-source project please reach out.
User Story:
As a Non-Profit Analyst who deals with messy CRM data with a significant population of duplicates (record clusters) spread across multiple incomplete records, it would be lovely if there was a Record Linkage Dedupe Plugin available for DSS that would make this process more accessible to a broader set of analysts. There are a number of packages in the Python library world to do this kind of work. When this process is easier and more complete and we can find more records that belong to the same data clusters we will get more accurate analyses and models.
COS
Nice to Have:
Notes