Questions tagged [distributed-filesystems]
The distributed-filesystems tag has no usage guidance.
128
questions
37
votes
1
answer
34k
views
Why doesn't SSHFS let me look into a mounted directory?
I use SSHFS to mount a directory on a remote server. There is a user xxx on client and server. UID and GID are identical on both boxes.
I use
sshfs -o kernel_cache -o auto_cache -o reconnect -o ...
21
votes
4
answers
36k
views
Are ZFS clustered filesystems possible?
Is it possible to create a ZFS cluster? Or would you need to go the seemingly ugly (to me at least) route of ZFS with UFS formatted pools governed by GlusterFS?
The idea is to see if it is possible ...
18
votes
3
answers
18k
views
GlusterFS vs Ceph, which is better for production use for the moment? [closed]
I am evaluating GlusterFS and Ceph, seems Gluster is FUSE based which means it may be not as fast as Ceph. But looks like Gluster got a very friendly control panel and is ease to use.
Ceph was ...
14
votes
1
answer
12k
views
How does NFS read cache work on Debian?
I am planning to use NFS to serve out many small files. They will be read very often so client side caching is crucial. Does NFS handle this? Is there a way to increase the client side caching in some ...
11
votes
9
answers
5k
views
Geographically distributed file system with preferred locality
I'm building a application that needs to distribute a standard file server across a few sites over a WAN. Basically, each site needs to write a lot of misc files of varying size (some in the 100s MB ...
10
votes
5
answers
5k
views
Is there a Distributed SAN/Storage System out there? [closed]
Like many other places, we ask our users not to save files to their local machines. Instead, we encourage that they be put on a file server so that others (with appropriate permissions) can use them ...
7
votes
9
answers
7k
views
Distributed file system with local disk cache
My server infrastructure is growing fast and I decided to create a distributed storage cluster. I've been looking for a proper filesystem for this task which meet my requirement, but none of them ...
7
votes
5
answers
7k
views
Experience with MooseFS? [closed]
Anyone have any experience using MooseFS? I want an easy distributed storage platform to store static data archive of about 10 TB and serve it to 20-40 nodes. Also I want to be able to add storage ...
7
votes
4
answers
1k
views
Distributed file systems
I need to implement a distributed storage system for a set of nodes(devices) connected in a mesh network.
So what basically my design goals are:
The storage system should be capable of handling ...
6
votes
2
answers
664
views
What is reasonable storage failover time that most OS (VM) can tolerate?
I have a GlusterFS 2 node 2 replica setup. I am planning to use it as OpenStack instance store, in which the VM disk image is stored.
From my tests, if the GlusterFS node which the hypervisor ...
6
votes
0
answers
1k
views
How does S3FS (or any other S3 FUSE filsystem) compare to AWS FSx for Lustre + S3
I remember trying s3fs a year back, trying to use some S3 bucket as a FUSE filesystem. I remember it being quite laggy, especially when coupled with git operations on it (an oblivious system architect ...
6
votes
0
answers
182
views
Fast distributed filesystem for a large amounts of data with metadata in database [closed]
My project uses several processing machines and one storage machine. Currently storage organized with a MSSQL filetable shared folder. Every file in storage have some metadata in database.
Processing ...
6
votes
1
answer
19k
views
GlusterFS Transport endpoint not connected from time to time
I'm using GlusterFS 3.7.9, currently on a single server with 4 bricks.
Each brick has 4TB and the volume is set up as distribute only.
The volume is mounted on a secondary server and I use it for ...
5
votes
3
answers
5k
views
What differences are between a NAS, a shared disk file system on a SAN, and a distributed filesystem?
https://en.wikipedia.org/wiki/Clustered_file_system#Network-attached_storage says
Network-attached storage (NAS) provides both storage and a file
system, like a shared disk file system on top of a ...
5
votes
3
answers
553
views
Are we using DFS "wrong"?
We are a company that has branches across the country. We have a minimum of 1 T1 to each branch and a maxiumum of 2 T1s. We have a DFS server at each branch and at our main office. In the past week ...
5
votes
2
answers
1k
views
HW/SW Design: 2 Petabyte of storage
Disclaimer Yes I'm asking you to design a system for me :)
I'm tasked to design a system to store about 10 TB / day with a retention time of 180 days.
My first approach would be to go with GlusterFS ...
5
votes
1
answer
3k
views
Can you create a glusterfs with existing data in a directory?
I'm looking into convert a single server/comp into the start of a glusterfs distributed system. I already have a directory mounted on this server of 24TB RAID. I want to use this initial computer to ...
5
votes
1
answer
4k
views
Why is GlusterFS so slow here?
We've set up a mirroring pair of GlusterFS servers. No special tuning, whatever came "out of the box" with GlusterFS-3.5.1 in the official RHEL6 RPM, that's what we have.
The cluster works, but the ...
5
votes
1
answer
969
views
Ceph: Why is a greater number of "placement groups" a "bad thing"?
I have been researching distributed databases and file systems, and while I was originally mostly interested in Hadoop/HBase because I'm a Java programmer, I found this very interesting document about ...
5
votes
1
answer
160
views
deduplicating and indexing directories of images across 150 linux machines
I have a client with 150 Linux servers spread about various cloud services and physical data-centres. Much of this infrastructure is acquired projects/teams and pre-existing servers/installs.
The ...
5
votes
2
answers
3k
views
what distributed file system for a two-node failover setup?
I'm trying to set up a redundant setup consisting of two servers that have everything redundant:
the database (MySQL master-master in active/passive mode)
the file system (distributed/replicated)
...
5
votes
2
answers
2k
views
Is there a way to connect a remote folder with local cache on OSX or Windows workstation?
A company with 50 graphic designers are working on the same file server. They are using mainly Indesign.
A typical project is a 60 pages indesign document with 1.5GB of linked files (something like ...
4
votes
2
answers
270
views
Distributed storage [closed]
At my University Department we are about to upgrade the computers of our student lab (about 25-30 machines). The machines will be running Linux.
One thing about the new machines is that they have ...
4
votes
2
answers
2k
views
Linux filesystem or CDN for millions of files with replication
Please tell me solution for this scenario:
several millions files, located in one directory ("img/8898f6152a0ecd7997a68631768fb72e9ac2efe1_1.jpg")
~80k file size in average
90% random read access
...
4
votes
4
answers
213
views
What should be done with excess computing resources after converting to thin clients?
I work part time for a small private school. The 24 node computer lab kept having hardware failures (mostly drives and cooling fans) so I turned it into a Linux based thin client network. Although the ...
4
votes
2
answers
3k
views
Using DFS roots as shared folders
I am curious if I can use DFS roots as shared folders without using any DFS links.
Some background: I like the idea of using DFS for name abstraction. By using domain-based namespaces, I can abstract ...
4
votes
2
answers
868
views
GlusterFS alternative for file upload website
I have a few file upload websites, with files ranging from hundreds of kilobytes to a few gigabytes.
Currently I have all files in a distribute-replicate Gluster volume on a few servers.
My ...
4
votes
2
answers
712
views
Distributed filesystem across a slow link
I have an image in my head where a link is too slow to realize the real-time transfer of files, but fast enough to catch up every day. What I'd like to see is a master <-> master setup where when I ...
4
votes
2
answers
3k
views
linux distributed file system over WAN advice
I have a fairly simple (not really) requirement but I've looked at a few solutions and can't find a good solution. I've got a Red Hat EL 6 server environment in my co-location and office, and some ...
4
votes
3
answers
132
views
How can I tie togeather extra space on Macintosh desktops with a distributed filesystem?
I have access to a bunch of Mac desktops, the hard drives of which are under-utilized. I want to set up a distributed filesystem to gang them together into one large virtual volume. The server has to ...
4
votes
5
answers
3k
views
Setting up distributed fault tolerant storage at home
I am tired of worrying about data loss at home. My wife is a semi pro photographer, and essentially all of our family memories are digital (and we ought to convert the ones that are not). I am ...
4
votes
1
answer
1k
views
Distributed mirrored filesystem under FreeBSD
Can someone share their experience in building a distributed mirrored filesystem between multiple FreeBSD machines? I. e. we have two (three, four...) servers and special partition "part1" mounted on ...
4
votes
1
answer
3k
views
In a distributed filesystem like MooseFS or XtreemFS, should individual nodes expose "raw-er" storage, or LVM'd storage?
When preparing an infrastructure to utilize a distributed storage system like MooseFS or XtreemFS, how should individual nodes present storage to the rest of the environment?
Is it better for ...
4
votes
1
answer
721
views
Recommendations for distributed processing/distributed storage systems
At my organization we have a processing and storage system spread across two dozen linux machines that handles over a petabyte of data. The system right now is very ad-hoc; processing automation and ...
4
votes
1
answer
711
views
Distributed File Systems
So, I've been reading several articles around ServerFault as well as google. (For Example, this link)
My Requirements are very similar to the link above, however i'd like to also have dynamic or at ...
4
votes
1
answer
474
views
What are the functionalities of Distributed File systems and Distributed Storage Systems?
i'm reading cloud vendors solutions for the distributed storage systems such as Amazon Dynamo and Google Big Table.
and really confused in two terms :
what is Distrubuted file systems, Do cloud ...
3
votes
3
answers
1k
views
failsafe RAM drive solution [closed]
I'm looking for a production solution to create a RAM drive that will be safely synchronized with HDD.
I have a piece of custom software with heavy I/O load (this is some kind of proprietary document-...
3
votes
1
answer
2k
views
Setting up a Rails 2.3.x app on EC2 for easy scalability
I'm running a simple rails stack on a single dedicated machine. We're reaching our full capacity and have absolutely no setup for scaling, just one app on one machine. I did some research and came up ...
3
votes
1
answer
1k
views
Distributed filesystem for automated offline data mirroring [closed]
I'd like to achieve the following setup:
Every time I connect my laptop to a local network, my partition gets automatically mirrored to a partition on my local server.
I only want to mirror what has ...
3
votes
4
answers
972
views
Server setup for image storage
I need to store 25M Photos in 4 sizes = total 100M Files, the filesize will vary between 3Kb and 200 kb per file and the used storage at beginning is about 14-15 TB.
Our goal is to have the data on 2-...
3
votes
2
answers
12k
views
Umount stale glusterfs partition
I am using glusterfs on several Ubuntu servers: two of them are running glusterfs servers in replication mode.
Without any clear error, the glusterfs partition became stale and the system shows this ...
3
votes
2
answers
2k
views
How backup a distributed file system?
Note: This is a "theoretical" question, as I haven't got that kind of data yet.
If you have a distributed file system spanning a dozen or more servers, and TBs of data, how do you perform backups of ...
3
votes
1
answer
158
views
Is distributed storage on an internal network possible?
I've been thinking about all the workstations I have on my shop floor (about 50) and the wasted drive space each of them has. For instance, my machines only use about 30G to 40G of local storage, yet ...
3
votes
1
answer
2k
views
GlusterFS vs Ceph, which is better for production use in 2012? [closed]
This is the same question that was asked here, but it's been almost two years since.
Meanwhile Ceph has seen constant development (361 kernel commits) and btrfs, in my opinion, is on the verge of ...
3
votes
2
answers
596
views
Distributed, Parallel, Fault-tolerant File System with high throughput
I am looking for DFS (distributed file system) that is fault tolerant and easy to maintain. I will have tons (100M+) of small files (from 1K to 500K). Files will be located in some directories what ...
3
votes
1
answer
394
views
Does Perkeep (camlistore) have built in protection from bitrot?
Does Perkeep (PKA Camlistore) offer protection from silent corruption (eg. bitrot) of the data in its current design like what is offered in ZFS. If it does, how well does it fare when compared to ZFS?...
3
votes
3
answers
808
views
Choice of distributed filesystem for intensive reads and writes
I have a series of servers (HP ProLiant, 34 servers) each of which with 500 G of hard drive space. These servers are part of a computational cluster which runs processes that roughly fall into two "...
3
votes
1
answer
3k
views
How to install ceph on EC2 Amazon Linux AMI
I want to test Ceph (a distributed network storage and file system) on some EC2 hosts which is derived from Amazon Linux AMI (amzn-ami-2011.09.2.x86_64-ebs).
The kernel version is 3.2 and btrfs is ...
3
votes
0
answers
3k
views
NFS showing files in directory, but can't open or stat
I'm using a network of Linux (Debian Squeeze on kernel 2.6.32) machines, sharing files using NFS (v3). The scenario is that a process running on client A will create a file through NFS on file server ...
3
votes
0
answers
808
views
Are snapshots and clones filesystem-wide in ZFS-backed Lustre clusters
My goal is to find a distributed filesystem on Linux that supports ZFS-like lightweight snapshots and snapshot clones. This StackOverflow question expresses what I'm looking for pretty well. I'm ...