One of the obstacles and concerns organizations have before publishing open data is unintentional disclosure of private information about individuals or other sensitive information.
Example for unintentional disclosure during the publication of open data can be publication of peoples names, addresses, bank account and credit card numbers, etc.
One of the most widely used tool for wokring with data is MS Excel. It will be great to have native plug-in that can run such checks/filtering directly from Excel application.
Diversity of data formats makes it important flexibility in way data is loaded into the tool. Support of widely used formats such as XML, JSON, CSV/Excel, ESRI Shapefiles, RDBMS/SQL, etc will make this tool easier to use.
Solution should be a simple tool/application that can take open data in any machine readable format and scan it for presence of private/sensitive data.
Preferably tool should be highly configurable to tune filtering/search for different problems (e.g. people related, bank accounts/credit cards related, special catchwords, etc)