Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/175633
Full metadata record
DC FieldValueLanguage
dc.contributor.authorLee, Cara Zheng Yanen_US
dc.date.accessioned2024-05-02T02:30:51Z-
dc.date.available2024-05-02T02:30:51Z-
dc.date.issued2024-
dc.identifier.citationLee, C. Z. Y. (2024). Detecting fraud via statistical anomalies. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175633en_US
dc.identifier.urihttps://hdl.handle.net/10356/175633-
dc.description.abstractUrban planners and researchers are increasingly integrating mobility data in designing smarter and sustainable cities. It is therefore crucial to identify any anomalies in the dataset to prevent poor planning or statistical interferences. Such mobility data could come from public sources or data brokers, like CityData who offers products for their customers’ economic development. Few studies had detected anomalies in the September 2020 dataset provided by CityData in the context of their research [1], [2] but there is a general lack of studies that focused on analysing those anomalies. Therefore, the purpose of this report is to: find more anomalies not present in previous studies, determine the manipulated ping percentage in each Singapore zone, and then determine if the data was intentionally manipulated. We did these by synthesising statistical techniques proposed by [3] and three other mathematical methods. We found three more anomalies: a circle and line segment, excessive pings, and squares. The number of decimal places (d.p) a ping could have was classified into 16 independent and uniformly distributed bins. We found that our statistical anomalies were the excessive ping anomalies whose d.p do not follow a uniform distribution. Our results indicated that Mandai and Southern Islands produced the highest manipulated percentages while River Valley produced the lowest manipulated percentage. Moreover, Central Area had the largest manipulated percentage SD across all regions. Thus, CityData might had intentionally manipulated the dataset to corroborate the interests of Singapore’s urban planners.en_US
dc.language.isoenen_US
dc.publisherNanyang Technological Universityen_US
dc.subjectMathematical Sciencesen_US
dc.titleDetecting fraud via statistical anomaliesen_US
dc.typeFinal Year Project (FYP)en_US
dc.contributor.supervisorFedor Duzhinen_US
dc.contributor.schoolSchool of Physical and Mathematical Sciencesen_US
dc.description.degreeBachelor's degreeen_US
dc.contributor.supervisoremailFDuzhin@ntu.edu.sgen_US
dc.subject.keywordsStatistical anomaliesen_US
dc.subject.keywordsAnomaliesen_US
dc.subject.keywordsFrauden_US
item.grantfulltextrestricted-
item.fulltextWith Fulltext-
Appears in Collections:SPMS Student Reports (FYP/IA/PA/PI)
Files in This Item:
File Description SizeFormat 
Final_FYP_draft.pdf
  Restricted Access
18.54 MBAdobe PDFView/Open

Page view(s)

48
Updated on Sep 15, 2024

Download(s)

5
Updated on Sep 15, 2024

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.