Estimating Korean residence registration numbers from public information on SNS

Daeseon Choi, Younho Lee, Yongsu Park, Seokhyun Kim

Research output: Contribution to journalArticle

Abstract

People expose their personal information on social network services (SNSs). This paper warns of the dangers of this practice by way of an example. We show that the residence registration numbers (RRNs) of many Koreans, which are very important and confidential personal information analogous to social security numbers in the United States, can be estimated solely from the information that they have made open to the public. In our study, we utilized machine learning algorithms to infer information that was then used to extract a part of the RRNs. Consequently, we were able to extract 45.5% of SNS users' RRNs using a machine learning algorithm and brute-force search that did not consume exorbitant amounts of resources.

Original languageEnglish
Pages (from-to)565-574
Number of pages10
JournalIEICE Transactions on Communications
VolumeE98B
Issue number4
DOIs
StatePublished - 2015 Apr 1

Fingerprint

Learning algorithms
Learning systems

Keywords

  • Data estimation
  • Korean residence registration number
  • Personal data security
  • SNS security
  • Security

Cite this

Choi, Daeseon ; Lee, Younho ; Park, Yongsu ; Kim, Seokhyun. / Estimating Korean residence registration numbers from public information on SNS. In: IEICE Transactions on Communications. 2015 ; Vol. E98B, No. 4. pp. 565-574.
@article{baae1bf3f3f446a9aac9b66345d60959,
title = "Estimating Korean residence registration numbers from public information on SNS",
abstract = "People expose their personal information on social network services (SNSs). This paper warns of the dangers of this practice by way of an example. We show that the residence registration numbers (RRNs) of many Koreans, which are very important and confidential personal information analogous to social security numbers in the United States, can be estimated solely from the information that they have made open to the public. In our study, we utilized machine learning algorithms to infer information that was then used to extract a part of the RRNs. Consequently, we were able to extract 45.5{\%} of SNS users' RRNs using a machine learning algorithm and brute-force search that did not consume exorbitant amounts of resources.",
keywords = "Data estimation, Korean residence registration number, Personal data security, SNS security, Security",
author = "Daeseon Choi and Younho Lee and Yongsu Park and Seokhyun Kim",
year = "2015",
month = "4",
day = "1",
doi = "10.1587/transcom.E98.B.565",
language = "English",
volume = "E98B",
pages = "565--574",
journal = "IEICE Transactions on Communications",
issn = "0916-8516",
number = "4",

}

Estimating Korean residence registration numbers from public information on SNS. / Choi, Daeseon; Lee, Younho; Park, Yongsu; Kim, Seokhyun.

In: IEICE Transactions on Communications, Vol. E98B, No. 4, 01.04.2015, p. 565-574.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Estimating Korean residence registration numbers from public information on SNS

AU - Choi, Daeseon

AU - Lee, Younho

AU - Park, Yongsu

AU - Kim, Seokhyun

PY - 2015/4/1

Y1 - 2015/4/1

N2 - People expose their personal information on social network services (SNSs). This paper warns of the dangers of this practice by way of an example. We show that the residence registration numbers (RRNs) of many Koreans, which are very important and confidential personal information analogous to social security numbers in the United States, can be estimated solely from the information that they have made open to the public. In our study, we utilized machine learning algorithms to infer information that was then used to extract a part of the RRNs. Consequently, we were able to extract 45.5% of SNS users' RRNs using a machine learning algorithm and brute-force search that did not consume exorbitant amounts of resources.

AB - People expose their personal information on social network services (SNSs). This paper warns of the dangers of this practice by way of an example. We show that the residence registration numbers (RRNs) of many Koreans, which are very important and confidential personal information analogous to social security numbers in the United States, can be estimated solely from the information that they have made open to the public. In our study, we utilized machine learning algorithms to infer information that was then used to extract a part of the RRNs. Consequently, we were able to extract 45.5% of SNS users' RRNs using a machine learning algorithm and brute-force search that did not consume exorbitant amounts of resources.

KW - Data estimation

KW - Korean residence registration number

KW - Personal data security

KW - SNS security

KW - Security

UR - http://www.scopus.com/inward/record.url?scp=84926474373&partnerID=8YFLogxK

U2 - 10.1587/transcom.E98.B.565

DO - 10.1587/transcom.E98.B.565

M3 - Article

AN - SCOPUS:84926474373

VL - E98B

SP - 565

EP - 574

JO - IEICE Transactions on Communications

JF - IEICE Transactions on Communications

SN - 0916-8516

IS - 4

ER -