Questions tagged [distributed-filesystems]

The tag has no usage guidance.

Filter by
Sorted by
Tagged with
37 votes
1 answer
34k views

Why doesn't SSHFS let me look into a mounted directory?

I use SSHFS to mount a directory on a remote server. There is a user xxx on client and server. UID and GID are identical on both boxes. I use sshfs -o kernel_cache -o auto_cache -o reconnect -o ...
Jan Deinhard's user avatar
  • 2,383
21 votes
4 answers
36k views

Are ZFS clustered filesystems possible?

Is it possible to create a ZFS cluster? Or would you need to go the seemingly ugly (to me at least) route of ZFS with UFS formatted pools governed by GlusterFS? The idea is to see if it is possible ...
SteveMustafa's user avatar
18 votes
3 answers
18k views

GlusterFS vs Ceph, which is better for production use for the moment? [closed]

I am evaluating GlusterFS and Ceph, seems Gluster is FUSE based which means it may be not as fast as Ceph. But looks like Gluster got a very friendly control panel and is ease to use. Ceph was ...
Mickey Shine's user avatar
14 votes
1 answer
12k views

How does NFS read cache work on Debian?

I am planning to use NFS to serve out many small files. They will be read very often so client side caching is crucial. Does NFS handle this? Is there a way to increase the client side caching in some ...
Ztyx's user avatar
  • 1,405
11 votes
9 answers
5k views

Geographically distributed file system with preferred locality

I'm building a application that needs to distribute a standard file server across a few sites over a WAN. Basically, each site needs to write a lot of misc files of varying size (some in the 100s MB ...
dpb's user avatar
  • 445
10 votes
5 answers
5k views

Is there a Distributed SAN/Storage System out there? [closed]

Like many other places, we ask our users not to save files to their local machines. Instead, we encourage that they be put on a file server so that others (with appropriate permissions) can use them ...
Joel Coel's user avatar
  • 13k
7 votes
9 answers
7k views

Distributed file system with local disk cache

My server infrastructure is growing fast and I decided to create a distributed storage cluster. I've been looking for a proper filesystem for this task which meet my requirement, but none of them ...
Galmi's user avatar
  • 121
7 votes
5 answers
7k views

Experience with MooseFS? [closed]

Anyone have any experience using MooseFS? I want an easy distributed storage platform to store static data archive of about 10 TB and serve it to 20-40 nodes. Also I want to be able to add storage ...
brown.2179's user avatar
7 votes
4 answers
1k views

Distributed file systems

I need to implement a distributed storage system for a set of nodes(devices) connected in a mesh network. So what basically my design goals are: The storage system should be capable of handling ...
sud03r's user avatar
  • 191
6 votes
2 answers
664 views

What is reasonable storage failover time that most OS (VM) can tolerate?

I have a GlusterFS 2 node 2 replica setup. I am planning to use it as OpenStack instance store, in which the VM disk image is stored. From my tests, if the GlusterFS node which the hypervisor ...
Pellaeon's user avatar
  • 953
6 votes
0 answers
1k views

How does S3FS (or any other S3 FUSE filsystem) compare to AWS FSx for Lustre + S3

I remember trying s3fs a year back, trying to use some S3 bucket as a FUSE filesystem. I remember it being quite laggy, especially when coupled with git operations on it (an oblivious system architect ...
dimisjim's user avatar
  • 245
6 votes
0 answers
182 views

Fast distributed filesystem for a large amounts of data with metadata in database [closed]

My project uses several processing machines and one storage machine. Currently storage organized with a MSSQL filetable shared folder. Every file in storage have some metadata in database. Processing ...
Vasilly.Prokopyev's user avatar
6 votes
1 answer
19k views

GlusterFS Transport endpoint not connected from time to time

I'm using GlusterFS 3.7.9, currently on a single server with 4 bricks. Each brick has 4TB and the volume is set up as distribute only. The volume is mounted on a secondary server and I use it for ...
Alex Dumitru's user avatar
5 votes
3 answers
5k views

What differences are between a NAS, a shared disk file system on a SAN, and a distributed filesystem?

https://en.wikipedia.org/wiki/Clustered_file_system#Network-attached_storage says Network-attached storage (NAS) provides both storage and a file system, like a shared disk file system on top of a ...
Tim's user avatar
  • 1,497
5 votes
3 answers
553 views

Are we using DFS "wrong"?

We are a company that has branches across the country. We have a minimum of 1 T1 to each branch and a maxiumum of 2 T1s. We have a DFS server at each branch and at our main office. In the past week ...
Jason's user avatar
  • 261
5 votes
2 answers
1k views

HW/SW Design: 2 Petabyte of storage

Disclaimer Yes I'm asking you to design a system for me :) I'm tasked to design a system to store about 10 TB / day with a retention time of 180 days. My first approach would be to go with GlusterFS ...
serverhorror's user avatar
  • 6,488
5 votes
1 answer
3k views

Can you create a glusterfs with existing data in a directory?

I'm looking into convert a single server/comp into the start of a glusterfs distributed system. I already have a directory mounted on this server of 24TB RAID. I want to use this initial computer to ...
Joey's user avatar
  • 151
5 votes
1 answer
4k views

Why is GlusterFS so slow here?

We've set up a mirroring pair of GlusterFS servers. No special tuning, whatever came "out of the box" with GlusterFS-3.5.1 in the official RHEL6 RPM, that's what we have. The cluster works, but the ...
Mikhail T.'s user avatar
  • 2,347
5 votes
1 answer
969 views

Ceph: Why is a greater number of "placement groups" a "bad thing"?

I have been researching distributed databases and file systems, and while I was originally mostly interested in Hadoop/HBase because I'm a Java programmer, I found this very interesting document about ...
monster's user avatar
  • 618
5 votes
1 answer
160 views

deduplicating and indexing directories of images across 150 linux machines

I have a client with 150 Linux servers spread about various cloud services and physical data-centres. Much of this infrastructure is acquired projects/teams and pre-existing servers/installs. The ...
Tom's user avatar
  • 11.2k
5 votes
2 answers
3k views

what distributed file system for a two-node failover setup?

I'm trying to set up a redundant setup consisting of two servers that have everything redundant: the database (MySQL master-master in active/passive mode) the file system (distributed/replicated) ...
Udo G's user avatar
  • 443
5 votes
2 answers
2k views

Is there a way to connect a remote folder with local cache on OSX or Windows workstation?

A company with 50 graphic designers are working on the same file server. They are using mainly Indesign. A typical project is a 60 pages indesign document with 1.5GB of linked files (something like ...
bokan's user avatar
  • 224
4 votes
2 answers
270 views

Distributed storage [closed]

At my University Department we are about to upgrade the computers of our student lab (about 25-30 machines). The machines will be running Linux. One thing about the new machines is that they have ...
nplatis's user avatar
  • 141
4 votes
2 answers
2k views

Linux filesystem or CDN for millions of files with replication

Please tell me solution for this scenario: several millions files, located in one directory ("img/8898f6152a0ecd7997a68631768fb72e9ac2efe1_1.jpg") ~80k file size in average 90% random read access ...
Roman Skvazh's user avatar
4 votes
4 answers
213 views

What should be done with excess computing resources after converting to thin clients?

I work part time for a small private school. The 24 node computer lab kept having hardware failures (mostly drives and cooling fans) so I turned it into a Linux based thin client network. Although the ...
Kenneth Cochran's user avatar
4 votes
2 answers
3k views

Using DFS roots as shared folders

I am curious if I can use DFS roots as shared folders without using any DFS links. Some background: I like the idea of using DFS for name abstraction. By using domain-based namespaces, I can abstract ...
ejel's user avatar
  • 173
4 votes
2 answers
868 views

GlusterFS alternative for file upload website

I have a few file upload websites, with files ranging from hundreds of kilobytes to a few gigabytes. Currently I have all files in a distribute-replicate Gluster volume on a few servers. My ...
Alex Dumitru's user avatar
4 votes
2 answers
712 views

Distributed filesystem across a slow link

I have an image in my head where a link is too slow to realize the real-time transfer of files, but fast enough to catch up every day. What I'd like to see is a master <-> master setup where when I ...
Jeff Ferland's user avatar
  • 20.6k
4 votes
2 answers
3k views

linux distributed file system over WAN advice

I have a fairly simple (not really) requirement but I've looked at a few solutions and can't find a good solution. I've got a Red Hat EL 6 server environment in my co-location and office, and some ...
dmansfield's user avatar
4 votes
3 answers
132 views

How can I tie togeather extra space on Macintosh desktops with a distributed filesystem?

I have access to a bunch of Mac desktops, the hard drives of which are under-utilized. I want to set up a distributed filesystem to gang them together into one large virtual volume. The server has to ...
interfect's user avatar
  • 323
4 votes
5 answers
3k views

Setting up distributed fault tolerant storage at home

I am tired of worrying about data loss at home. My wife is a semi pro photographer, and essentially all of our family memories are digital (and we ought to convert the ones that are not). I am ...
4 votes
1 answer
1k views

Distributed mirrored filesystem under FreeBSD

Can someone share their experience in building a distributed mirrored filesystem between multiple FreeBSD machines? I. e. we have two (three, four...) servers and special partition "part1" mounted on ...
Mikhail Efremov's user avatar
4 votes
1 answer
3k views

In a distributed filesystem like MooseFS or XtreemFS, should individual nodes expose "raw-er" storage, or LVM'd storage?

When preparing an infrastructure to utilize a distributed storage system like MooseFS or XtreemFS, how should individual nodes present storage to the rest of the environment? Is it better for ...
warren's user avatar
  • 18.6k
4 votes
1 answer
721 views

Recommendations for distributed processing/distributed storage systems

At my organization we have a processing and storage system spread across two dozen linux machines that handles over a petabyte of data. The system right now is very ad-hoc; processing automation and ...
Eddie's user avatar
  • 323
4 votes
1 answer
711 views

Distributed File Systems

So, I've been reading several articles around ServerFault as well as google. (For Example, this link) My Requirements are very similar to the link above, however i'd like to also have dynamic or at ...
grufftech's user avatar
  • 6,840
4 votes
1 answer
474 views

What are the functionalities of Distributed File systems and Distributed Storage Systems?

i'm reading cloud vendors solutions for the distributed storage systems such as Amazon Dynamo and Google Big Table. and really confused in two terms : what is Distrubuted file systems, Do cloud ...
Berkay's user avatar
  • 431
3 votes
3 answers
1k views

failsafe RAM drive solution [closed]

I'm looking for a production solution to create a RAM drive that will be safely synchronized with HDD. I have a piece of custom software with heavy I/O load (this is some kind of proprietary document-...
user1450663's user avatar
3 votes
1 answer
2k views

Setting up a Rails 2.3.x app on EC2 for easy scalability

I'm running a simple rails stack on a single dedicated machine. We're reaching our full capacity and have absolutely no setup for scaling, just one app on one machine. I did some research and came up ...
Max Chernyak's user avatar
3 votes
1 answer
1k views

Distributed filesystem for automated offline data mirroring [closed]

I'd like to achieve the following setup: Every time I connect my laptop to a local network, my partition gets automatically mirrored to a partition on my local server. I only want to mirror what has ...
Petr's user avatar
  • 613
3 votes
4 answers
972 views

Server setup for image storage

I need to store 25M Photos in 4 sizes = total 100M Files, the filesize will vary between 3Kb and 200 kb per file and the used storage at beginning is about 14-15 TB. Our goal is to have the data on 2-...
Nenad's user avatar
  • 375
3 votes
2 answers
12k views

Umount stale glusterfs partition

I am using glusterfs on several Ubuntu servers: two of them are running glusterfs servers in replication mode. Without any clear error, the glusterfs partition became stale and the system shows this ...
Khaled's user avatar
  • 36.7k
3 votes
2 answers
2k views

How backup a distributed file system?

Note: This is a "theoretical" question, as I haven't got that kind of data yet. If you have a distributed file system spanning a dozen or more servers, and TBs of data, how do you perform backups of ...
monster's user avatar
  • 618
3 votes
1 answer
158 views

Is distributed storage on an internal network possible?

I've been thinking about all the workstations I have on my shop floor (about 50) and the wasted drive space each of them has. For instance, my machines only use about 30G to 40G of local storage, yet ...
Albion's user avatar
  • 465
3 votes
1 answer
2k views

GlusterFS vs Ceph, which is better for production use in 2012? [closed]

This is the same question that was asked here, but it's been almost two years since. Meanwhile Ceph has seen constant development (361 kernel commits) and btrfs, in my opinion, is on the verge of ...
al.'s user avatar
  • 925
3 votes
2 answers
596 views

Distributed, Parallel, Fault-tolerant File System with high throughput

I am looking for DFS (distributed file system) that is fault tolerant and easy to maintain. I will have tons (100M+) of small files (from 1K to 500K). Files will be located in some directories what ...
Worker's user avatar
  • 647
3 votes
1 answer
394 views

Does Perkeep (camlistore) have built in protection from bitrot?

Does Perkeep (PKA Camlistore) offer protection from silent corruption (eg. bitrot) of the data in its current design like what is offered in ZFS. If it does, how well does it fare when compared to ZFS?...
Timothy C. Quinn's user avatar
3 votes
3 answers
808 views

Choice of distributed filesystem for intensive reads and writes

I have a series of servers (HP ProLiant, 34 servers) each of which with 500 G of hard drive space. These servers are part of a computational cluster which runs processes that roughly fall into two "...
Einar's user avatar
  • 225
3 votes
1 answer
3k views

How to install ceph on EC2 Amazon Linux AMI

I want to test Ceph (a distributed network storage and file system) on some EC2 hosts which is derived from Amazon Linux AMI (amzn-ami-2011.09.2.x86_64-ebs). The kernel version is 3.2 and btrfs is ...
takaomag's user avatar
  • 261
3 votes
0 answers
3k views

NFS showing files in directory, but can't open or stat

I'm using a network of Linux (Debian Squeeze on kernel 2.6.32) machines, sharing files using NFS (v3). The scenario is that a process running on client A will create a file through NFS on file server ...
user79126's user avatar
  • 469
3 votes
0 answers
808 views

Are snapshots and clones filesystem-wide in ZFS-backed Lustre clusters

My goal is to find a distributed filesystem on Linux that supports ZFS-like lightweight snapshots and snapshot clones. This StackOverflow question expresses what I'm looking for pretty well. I'm ...
Anand's user avatar
  • 31