Hi, I requested reddit for my data and I got 16Mb of CSVs… which is a considerable amount. Do anyone know of any tool to process / visualize / search … the data. I asume the format is the same for everyone, so maybe someone has already built something like that.

EDIT: the problem is not performance, with files <5Mb I can search with notepad++ in miliseconds. What I’m looking for is a user friendly interface (ideally with thumbnail images, links and such).

The problem with searching for “reddit export data visualizer” is that Google shows posts from reddit about visualization of generic data.

Thanks.

  • Crul@lemm.eeOP
    link
    fedilink
    arrow-up
    2
    ·
    1 year ago

    The links you posted are weird:

    • https://pixeldrain.com/u/KfgV7bqn: It offers to download a file with the name Antimutt in r-Excel ultra.paq8o which I have no idea what is for.

    • https://the-eye.eu/redarcs: It says “This Reddit Community Has Been Archived”

    • Antimutt@lemmy.world
      link
      fedilink
      arrow-up
      1
      ·
      1 year ago

      The first is the result when I extracted all lines with my nick in them from the csv, stored with the best compression around. The second is where to get the csv - and a lot of communities have been archived there, like it says.

      • Crul@lemm.eeOP
        link
        fedilink
        arrow-up
        2
        ·
        1 year ago

        Just to confirm I understand: you are talking about Power Query VS Power Bi for dealing with huge datasets, right?

        Because, in my case, with 16Mb, I don’t see the need for anything specially powerful. My problem is not performace, but convenience.

        Thanks for the input.