Home location detection algorithm comparison using mobile phone data vs real users ground truth

dc.contributor.advisorFerres, Leo
dc.contributor.authorSacasa Ares, Manuel Antonio
dc.date.accessioned2021-08-30T17:48:54Z
dc.date.available2021-08-30T17:48:54Z
dc.date.issued2020-08
dc.descriptionTesis presentada a la Facultad de Ingeniería de la Universidad del Desarrollo para optar al título de Magíster en Ciencia de Datoses
dc.description.abstractThe detection of Home location is one of the main profiling attributes for telcos, real estate, banking, advertising and targeting companies. Literature shows many examples of criteria, heuristics and algorithms trying to solve the problem. In this document, the objective is to select a final criteria and data source comparing: 5 algorithm, a 2-week data set of three mobile data sources (CDRs-XDRs-Control Plane) and a 75-inhabitants real address data set, built with a survey method, as a ground truth test data set. This lead us to understand: - The behavior of the data source: users hit frequency and the impact on algorithm results, method of processing home antenna ranking one, human dependency day-night source behavior and home antenna precision, hit recording logic, noise, error and their limits to solve home location. - The possible solutions: type of criteria, Geo-time heuristics, counting algorithm types (one or two-step), performance and precision of each algorithm linked to each source. - The metrics: Test correlation between the closest tower in Euclidean distance versus home tower (output of each algorithm), bias result from 3G data vs. 4g(Lte) antennas, measured Euclidean distance as error between real declared address coordinates and home tower when it’s the closest tower or not. The first step of the document takes the recommendations and implementation of HDAs (Home detection algorithms) and criteria [1][3][5][6][14][16] to compare the difference and behavior between sources and criteria applied to each algorithm (Ranking antenna’s hits, ranking antenna’s frequency, ranking with time filter, ranking with geographical filter and mix geo-time filters). The second step is to design metrics in order to compare the pair of best performers algorithm with their source: binary metric for a match between HDA result and home antenna ranking, valued match between ranking 1-2-3 home antenna, absolute error distance and MSE distance to real address. The final step is to test an experimental new approach, applying a circadian sleep cycle for each user, to detect time range and process an individual time-range home antenna as a solution for the gaps detected in the first two steps. Comparing circadian results versus the best HDA method, source and groundtruth.es
dc.format.extent20 p.es
dc.identifier.urihttp://hdl.handle.net/11447/4531
dc.language.isoenes
dc.publisherUniversidad del Desarrollo. Facultad de Ingenieríaes
dc.subjectHomees
dc.subjectLocationes
dc.subjectMobilees
dc.subjectResidencees
dc.subjectCDRes
dc.subjectXDRes
dc.subjectSignalizationes
dc.subjectMobilityes
dc.subjectUrbanes
dc.subject070037Ses
dc.titleHome location detection algorithm comparison using mobile phone data vs real users ground truthes
dc.typeThesises

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Home location detection algorithm comparison using mobile phone data vs real users ground truth.pdf
Size:
3.58 MB
Format:
Adobe Portable Document Format
Description:
Texto completo
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: