Your data is not highly sensitive (e.g. home locations in very sparsely populated areas, or records linked to medical data, etc.)
You want to publish your data in an online repository
You want to share your data with others (e.g. journal editor or reviewers, link data with your article, share with students in a course you're teaching
Step 1: Choose the right method
The two methods differ in their level of complexity and hence in how long it takes to process them. The donut approach adds a random noise based on a minimum and maximum threshold while the advanced method optimizes the noise based on the context and k-anonymity. Learn more
Go with the donut approach if:
Your data is not very sensitive (e.g. public places)
You want a fast process (almost instant – may vary based on data size and website traffic)
Go with the advanced anonymizer if:
Locations are sensitive (e.g. home locations, workplace)
You are not in a hurry to get the results (can take up to several hours depending on data size and website traffic
Step 2: prepare your data
Remove all personal information and identifiers from your data. You are uploading your data to the internet. So you should never include sensitive information that can be used to identify individuals.
Make sure your website’s number of rows does not exceed the current limit (For donut method 10000, for Advanced method 1500. If your dataset is bigger, you can split it into separate files to divide the task.
Get yourself an analysis token with a valid email (advanced anonymization results will be emailed to this address)
Here you can also retrieve your previously generated valid tokens, or delete them.
Tokens will be automatically deleted after 24 hours. You can delete them manually if you want your email address to be erased from the server before the token expires.
Step 4: Go to the tool
Go to your desired tool and provide the needed information (the recently generated token should be automatically loaded – if not, use the good old “copy and paste” method)
Run the analysis. In donut method, you will be redirected to the results page where you can download the anonymized data as a zipped shapefile. For the advanced anonymization, your request will be queued for processing and the results will be emailed to you as soon as they are available. (The link in the email will expire after 30 minutes)
GeoHub will not keep your data or your email address after the process has completed. Your email address is deleted from the server as soon as your process is ready and the email is sent. Your original and anonymized data will be erased within one hour from the completion of task from our sever.
For referring to the the methodology used for anonymization please use:
Hasanzadeh, K., Kajosaari, A., Häggman, D., & Kyttä, M. (2020). A context sensitive approach to anonymizing public participation GIS data: From development to the assessment of anonymization effects on data quality. Computers, Environment and Urban Systems, 83, 101513.