Cleaning Data in SQL Server Databases
Miriam Antona
Software Engineer
SELECT * FROM series
| id | name | contact_number | ... |
|-----|-----------------|----------------|-----|
| 1 | Adventure Time | 555-906-8845 | ... |
| 2 | Dexter | 555-156-8845 | ... |
| 3 | Futurama | 555-210-9951 | ... |
| 4 | Game of Thrones | 555-543-6641 | ... |
| ... | ... | ... | ... |
LIKE
.Wildcard character | Description | Example |
---|---|---|
% | Any string of zero or more characters | WHERE contact_number LIKE '555-%' |
- (underscore) | Any single character | WHERE contact_number LIKE '___-___-____' |
[] | Any single character within the specified range or set | WHERE contact_number LIKE '[2-9][0-9][0-9]-[2-9][0-9][0-9]-[0-9][0-9][0-9][0-9] |
[^] | Any single character not within the specified range or set | WHERE contact_number LIKE '[^2-9]' |
SELECT name, contact_number
FROM series
WHERE contact_number LIKE '555%'
| name | contact_number |
|-----------------|----------------|
| Adventure Time | 555-906-8845 |
| Dexter | 555-156-8845 |
| Futurama | 555-210-9951 |
| Game of Thrones | 555-abc-6641 |
| ... | ... |
SELECT
name,
contact_number
FROM series
WHERE contact_number NOT LIKE '555%'
| name | contact_number |
|-----------------|----------------|
| The Good Doctor | 000-930-1274 |
SELECT
name,
contact_number
FROM series
WHERE contact_number LIKE '[2-9][0-9][0-9]-[2-9][0-9][0-9]-[0-9][0-9][0-9][0-9]'
| name | contact_number |
|----------------|----------------|
| Adventure Time | 555-906-8845 |
| Futurama | 555-210-9951 |
| Homeland | 555-985-6314 |
| Westworld | 555-456-1234 |
| ... | ... |
SELECT
name,
contact_number
FROM series
WHERE contact_number NOT LIKE '[2-9][0-9][0-9]-[2-9][0-9][0-9]-[0-9][0-9][0-9][0-9]'
| name | contact_number |
|-----------------|----------------|
| Dexter | 555-156-8845 |
| Game of Thrones | 555-abc-6641 |
| The Good Doctor | 000-930-1274 |
Cleaning Data in SQL Server Databases