for line in tcp.collect():
hive_context.sql("SELECT 'zip' as Variable_name,percentile(zip, 0.25) as Q1, percentile(zip, 0.75) as Q3 FROM df_tab").show() -- Zip should be replaced by variable line
I tried to do something like this as well, but it dint work
query="SELECT {d_line} as Variable_name, percentile({line}, 0.25) as Q1, percentile({line}, 0.75) as Q3 FROM df_tab".format(d_line=line) --this gives me output as
从df_tab中选择zip作为变量名,percentile(zip,0.25)作为Q1,percentile(zip,0.75)作为Q3——这里的zip必须以单引号形式出现
预期的输出查询:从df_tab中选择'zip'作为变量名,percentile(zip,0.25)作为Q1,percentile(zip,0.75)作为Q3