如何在Julia中重塑和绘制DataFrame?

问题描述 投票:2回答:1

我正在寻找一种将DataFrame从宽到长整形然后绘制结果的方法(这应该是一个简单的操作,但是我是Julia的新手,而不是任何专家。)

具体地说,我有一个具有以下结构的数据框:

julia> df = DataFrame(Country = ["Italy","France","Germany"], Date1 = [1,4,6], Date2 = [2,5,9], Date3 = [4,3,12])
3×4 DataFrame
│ Row │ Country │ Date1 │ Date2 │ Date3 │
│     │ String  │ Int64 │ Int64 │ Int64 │
├─────┼─────────┼───────┼───────┼───────┤
│ 1   │ Italy   │ 1     │ 2     │ 4     │
│ 2   │ France  │ 4     │ 5     │ 3     │
│ 3   │ Germany │ 6     │ 9     │ 12    │

我已经成功地使用了stack()函数来重塑数据,如下所示:

julia> df_long = stack(df,2:4)
9×3 DataFrame
│ Row │ variable │ value │ Country │
│     │ Symbol   │ Int64 │ String  │
├─────┼──────────┼───────┼─────────┤
│ 1   │ Date1    │ 1     │ Italy   │
│ 2   │ Date1    │ 4     │ France  │
│ 3   │ Date1    │ 6     │ Germany │
│ 4   │ Date2    │ 2     │ Italy   │
│ 5   │ Date2    │ 5     │ France  │
│ 6   │ Date2    │ 9     │ Germany │
│ 7   │ Date3    │ 4     │ Italy   │
│ 8   │ Date3    │ 3     │ France  │
│ 9   │ Date3    │ 12    │ Germany │

现在,我想用x轴的variable列和y轴的value列创建一个绘图。但是,variable列的类型为Symbol(而不是我希望的String),因此无法绘制它。我用来创建绘图的代码是这样的:

julia> Plots.plot(df_long.variable,df_long.value)
ERROR: Cannot convert Symbol to series data for plotting
Stacktrace:
 [1] prepareSeriesData(::Symbol) at /Users/kayvon/.julia/packages/Plots/12uaJ/src/series.jl:14
 [2] convertToAnyVector(::Symbol, ::Dict{Symbol,Any}) at /Users/kayvon/.julia/packages/Plots/12uaJ/src/series.jl:27
 [3] (::Plots.var"#152#155"{Dict{Symbol,Any}})(::Symbol) at ./none:0
 [4] iterate(::Base.Generator{Array{Symbol,1},Plots.var"#152#155"{Dict{Symbol,Any}}}) at ./generator.jl:47
 [5] convertToAnyVector(::Array{Symbol,1}, ::Dict{Symbol,Any}) at /Users/kayvon/.julia/packages/Plots/12uaJ/src/series.jl:42
 [6] macro expansion at /Users/kayvon/.julia/packages/Plots/12uaJ/src/series.jl:130 [inlined]
 [7] apply_recipe(::Dict{Symbol,Any}, ::Type{Plots.SliceIt}, ::Array{Symbol,1}, ::Array{Int64,1}, ::Nothing) at /Users/kayvon/.julia/packages/RecipesBase/G4s6f/src/RecipesBase.jl:279
 [8] _process_userrecipes(::Plots.Plot{Plots.GRBackend}, ::Dict{Symbol,Any}, ::Tuple{Array{Symbol,1},Array{Int64,1}}) at /Users/kayvon/.julia/packages/Plots/12uaJ/src/pipeline.jl:85
 [9] _plot!(::Plots.Plot{Plots.GRBackend}, ::Dict{Symbol,Any}, ::Tuple{Array{Symbol,1},Array{Int64,1}}) at /Users/kayvon/.julia/packages/Plots/12uaJ/src/plot.jl:178
 [10] #plot#138(::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(plot), ::Array{Symbol,1}, ::Vararg{Any,N} where N) at /Users/kayvon/.julia/packages/Plots/12uaJ/src/plot.jl:57
 [11] plot(::Array{Symbol,1}, ::Array{Int64,1}) at /Users/kayvon/.julia/packages/Plots/12uaJ/src/plot.jl:51
 [12] top-level scope at none:0

是否有一种使用stack()的方式导致variable列的类型为String?还是我走错路了?有更简单的方法吗?

谢谢,我感谢您能提供的任何帮助!

dataframe plot julia reshape
1个回答
0
投票
plot(String.(df_long.variable),df_long.value)

注意点.,这是Julia的点运算符,它将Symbol s的整个向量转换为String s的向量。

但是,对于此数据,您可能会希望更多散点图。

scatter(String.(df_long.variable),df_long.value)
© www.soinside.com 2019 - 2024. All rights reserved.