UVa_1592 - Database
Peter studies the theory of relational databases. Table in the relational database consists of values that are arranged in rows and columns.
There are different normal forms that database may adhere to. Normal forms are designed to minimize the redundancy of data in the database. For example, a database table for a library might have a row for each book and columns for book name, book author, and author's email.
If the same author wrote several books, then this representation is clearly redundant. To formally define this kind of redundancy Peter has introduced his own normal form. A table is in Peter's Normal Form (PNF) if and only if there is no pair of rows and a pair of columns such that the values in the corresponding columns are the same for both rows.
How to compete in ACM ICPC | Peter | peter@neerc.ifmo.ru |
How to win ACM ICPC | Michael | michael@neerc.ifmo.ru |
Notes from ACM ICPC champion | Michael | michael@neerc.ifmo.ru |
The above table is clearly not in PNF, since values for 2rd and 3rd columns repeat in 2nd and 3rd rows. However, if we introduce unique author identifier and split this table into two tables -- one containing book name and author id, and the other containing book id, author name, and author email, then both resulting tables will be in PNF.
Given a table your task is to figure out whether it is in PNF or not.
Input
Input contains several datasets. The first line of each dataset contains two integer numbers n and m ( 1n10000, 1m10), the number of rows and columns in the table. The following n lines contain table rows. Each row has m column values separated by commas. Column values consist of ASCII characters from space (ASCII code 32) to tilde (ASCII code 126) with the exception of comma (ASCII code 44). Values are not empty and have no leading and trailing spaces. Each row has at most 80 characters (including separating commas).
Output
For each dataset, if the table is in PNF write to the output file a single word ``YES" (without quotes). If the table is not in PNF, then write three lines. On the first line write a single word ``NO" (without quotes). On the second line write two integer row numbers r1 and r2 ( 1r1, r2n, r1r2), on the third line write two integer column numbers c1 and c2 ( 1c1, c2m, c1c2), so that values in columns c1and c2 are the same in rows r1 and r2.
Sample Input
3 3 How to compete in ACM ICPC,Peter,peter@neerc.ifmo.ru How to win ACM ICPC,Michael,michael@neerc.ifmo.ru Notes from ACM ICPC champion,Michael,michael@neerc.ifmo.ru 2 3 1,Peter,peter@neerc.ifmo.ru 2,Michael,michael@neerc.ifmo.ru
Sample Output
NO 2 3 2 3 YES
题意:
给一个数据库,查找是否存在(r1,c1)=(r2,c1) && (r1,c2)=(r2,c2),即:不同的二行,对应二列字符串相同
#include<bits/stdc++.h> using namespace std; int n, m; string s, st; map<string, int> idmp; // 获取字符串id,若已存在,直接返回,否则分配id int getId(string s) { if (idmp.find(s) == idmp.end()) idmp.insert({s, idmp.size()}); // 不存在 return idmp[s]; } int main() { while (cin >>n >>m) { getchar(); vector<int> table[n+1]; idmp.clear(); // 每次清空! for (int i = 0; i < n; i ++) { getline(cin, s); stringstream input(s); while (getline(input, st, ',')) table[i].push_back(getId(st)); } bool isPNF = true; // 标记是否为PNF for (int i = 0; i < m-1 && isPNF; i ++) { // 遍历任意两列 for (int j = i+1; j < m && isPNF; j ++) { map<pair<int,int>, int> pos; // 两列对应字符串标号->行 for (int k = 0; k < n && isPNF; k ++) { // 每一行 if (pos.find({table[k][i],table[k][j]}) == pos.end()) { pos[{table[k][i],table[k][j]}] = k; } else { printf("NO\n%d %d\n%d %d\n", pos[{table[k][i],table[k][j]}]+1, k+1, i+1, j+1); isPNF = false; } } } } if (isPNF) printf("YES\n"); } return 0; }