datastax—使用dsbulk将json数据加载到cassandra中

kkbh8khc 于 2021-06-13 发布在 Cassandra

关注(0)|答案(0)|浏览(284)

我觉得将json文件加载到cassandra的文档在dsbulk文档中是非常缺乏的。
下面是我试图加载的json文件的一部分：

[
  {
    "tags": [
      "r"
    ],
    "owner": {
      "reputation": 23,
      "user_id": 12235281,
      "user_type": "registered",
      "profile_image": "https://www.gravatar.com/avatar/60e28f52215bff12adb9758fc2cf86dd?s=128&d=identicon&r=PG&f=1",
      "display_name": "Me28",
      "link": "https://stackoverflow.com/users/12235281/me28"
    },
    "is_answered": false,
    "view_count": 3,
    "answer_count": 0,
    "score": 0,
    "last_activity_date": 1589053659,
    "creation_date": 1589053659,
    "question_id": 61702762,
    "link": "https://stackoverflow.com/questions/61702762/merge-dataframes-in-r-with-different-size-and-condition",
    "title": "Merge dataframes in R with different size and condition"
  },
  {
    "tags": [
      "python",
      "location",
      "pyautogui"
    ],
    "owner": {
      "reputation": 1,
      "user_id": 13507535,
      "user_type": "registered",
      "profile_image": "https://lh3.googleusercontent.com/a-/AOh14GgtdM9KrbH3X5Z33RCtz6xm_TJUSQS_S31deNYUcA=k-s128",
      "display_name": "lowhatex",
      "link": "https://stackoverflow.com/users/13507535/lowhatex"
    },
    "is_answered": false,
    "view_count": 2,
    "answer_count": 0,
    "score": 0,
    "last_activity_date": 1589053657,
    "creation_date": 1589053657,
    "question_id": 61702761,
    "link": "https://stackoverflow.com/questions/61702761/want-to-get-a-grip-of-this-pyautogui-command",
    "title": "Want to get a grip of this pyautogui command"
  }
]

我尝试加载的方式如下： dsbulk load -url ./data_so1.json -k stackoverflow_t -t staging_t -h '182.14.0.1' -header false -u username -p password 这是我得到的最接近的值，它将值逐行推入cassandra，如下所示：

data
-------------------------------------------------------------------------------------------------------------------------------
      "title": "&#39;Microsoft.ACE.OLEDB.12.0&#39; provider is not registered on the local machine giving exception on client"
               "profile_image": "https://www.gravatar.com/avatar/05085ede54486bdaebefcf8363e081e2?s=128&d=identicon&r=PG&f=1",
                                                                                                            "view_count": 422,
                                                                                                      "question_id": 61702768,
                                                                                                          "user_id": 12235281,

这只是将行保持原样（包括逗号）。我试过使用-m键进行Map，但没有真正成功。
将这些值放到各自的列中的正确方法是什么？

cassandra datastax dsbulk

来源：https://stackoverflow.com/questions/61840149/loading-json-data-into-cassandra-using-dsbulk

暂无答案！

目前还没有任何答案，快来回答吧！

我来回答

datastax—使用dsbulk将json数据加载到cassandra中

暂无答案！

相关问题

热门标签

最新问答