Step 1 : Normalization
2015-10-05 16:49:47 6 举报
Normalization是一种数据处理技术,主要用于将不同尺度或范围的数据转换为统一的标准。这个过程通常包括将数据缩放到一个特定的范围,例如0到1,或者通过减去平均值并除以标准差来消除数据的偏差。Normalization在许多领域都有应用,如机器学习、数据挖掘和统计分析等。它可以帮助提高算法的性能,因为大多数算法都假设输入数据是标准化的。此外,Normalization还可以帮助我们更好地理解数据,因为它可以将复杂的数据集转化为更容易理解和比较的形式。
作者其他创作
大纲/内容
Street name Normalization
If Street name begins with 0
Try with Short Form if long format is given or long format whether short form is given
If street name contains FM/Farm Market and number
Remove the 0 that begins and only street name should present in the field.
Prefix of FM and Suffix of FM number should be removed. FM-xxxx will be exact format.
If street name contains text Highway or Hwy and number
All the special characters and values inside closed bracket needs to be removed
Prefix of Hwy and Suffix of Hwy number should be removed and highway/Hwy should be replaced by their short form of state. TX-35 will be the correct format.
If street name contains any special characters or values inside braces
0 条评论
下一页