您的位置：首页 > 数据库

hive中使用sql语句需要注意的事项

2014-04-14 21:39 609 查看

最近在熟悉hive，使用hive中的sql语句过程中出现了一些问题。

1，hive中的insert into语句

hive> select * from t_hive2;
OK
16	2	3
61	12	13
41	2	31
17	21	3
71	2	31
1	12	34
11	2	34
Time taken: 0.218 seconds
hive> insert into t_hive2 values(122,34,2);
FAILED: Parse Error: line 1:12 mismatched input 't_hive2' expecting TABLE near 'into' in insert clause

从上面可以看出，hive是不支持insert into 语句的

2，不支持Date和DateTime类型

hive> alter table t_hive2 add columns (time_show Date);
FAILED: Error in semantic analysis: DATE and DATETIME types aren't supported yet. Please use TIMESTAMP instead

3,hive的表分为外部表和内部表

Hive 创建内部表时，会将数据移动到数据仓库指向的路径；若创建外部表，仅记录数据所在的路径，不对数据的位置做任何改变。

在删除表的时候，内部表的元数据和数据会被一起删除，而外部表只删除元数据，不删除数据。这样外部表相对来说更加安全些，数据组织也更加灵活，方便共享源数据。

4，hive中两个表链接时和以前有些不同

hive> select *     
    > from t_hive,t_hive2
    > where t_hive.a=t_hive2.b;
FAILED: Parse Error: line 2:11 mismatched input ',' expecting EOF near 't_hive'

5，内置函数查看命令（show functions;desc function 函数名）

hive> desc function when;
OK
There is no documentation for function 'when'
Time taken: 0.095 seconds
hive> desc function rand;
OK
rand([seed]) - Returns a pseudorandom number between 0 and 1
Time taken: 0.078 seconds

6，hive中不支持truncate table 表名的形式存在(也包括 delete from 表名)，可通过 hive>dfs -rmr /user/hive/warehouse/表名来清空该表下的数据，以便保持表元数据信息不丢失；或者通过create table 表名 like 表名，也可以。

hive> truncate table t_hive;
FAILED: Parse Error: line 1:0 cannot recognize input near 'truncate' 'table' 't_hive'

hive> delete from table t_hive;
Usage: delete [FILE|JAR|ARCHIVE] <value> [<value>]*

7，hive中不支持 in (子查询语句)，比如： in (select id from 表名) .可以通过内连接或者半连接 from 表1 left semi join 表2 on （表1.列名 = 表2.列名），表2只能在on中出现，不能在select中引用

8，hive中在不需要全局排序的情况下，写排序语句时，最好用distribute by 表名.字段名 sort by 表名.字段名 asc | desc 的形式，尽量不用order by形式（只通过一个reduce来完成所有的排序结果）

内容来自用户分享和网络整理，不保证内容的准确性，如有侵权内容，可联系管理员处理

标签：

相关文章推荐

新的分享

章节导航